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PREFACE 


Personality psychology has changed during the past two or three 
decades. The change has been marked by a move away from theorizing 
on the grand scale and toward a greater concern with obtaining empirical 
answers to questions of manageable and modest proportions. By the same 
token, research in personality has turned increasingly to the methods and 
concepts of other areas of behavioral science; the shift from the study of 
the single case to the use of the controlled experiment is perhaps the most 
striking instance of this. 

With this movement, the lines of demarcation between personality 
psychology and other areas of psychology have become blurred. The 
student of personality is faced with considerable extension of his field 
of interest, and the usual concomitant difficulty in remaining au fait 
with the progress that is made in important technical areas. 

In this series, contributors will present both a summary of present 
knowledge on a specialized topic together with original data from their 
own investigations of the problem. This book is intended for psychologists 
and students of psychology, psychopathologists and all whose work re- 
quires knowledge of recent developments in the study of personality. 

Two original and systematic contributions to the understanding of 
schizophrenia appear in this first volume. Other papers deal with inter- 
personal perception, experimenter bias, the personality correlates of 
repression and sensitization, conditioning and personality, and processes 
in projective perception. 

Much editorial help has been given in the preparation of this volume 
by Kathleen Sylva. 


BRENDAN А. МАНЕ 


Cambridge, Massachusetts 
August 1964 
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I. Introduction 

A. Scope or THE REVIEW 

"Input dysfunction," rather than "disorder of perception or sensa- 
tion," has been chosen as the title of this review because it was felt neces- 
sary to avoid the restriction of emphasis which the older terminology 
connotes. Implicitly, if not explicitly, sensation and perception as ele- 
ments in cognitive processes have been thought of as confined to the 
functions of the direct sensory pathways and the receiving and association 
areas of the cortex. It is necessary at the present stage of our knowledge 
to bear in mind the many processes which interact before an item of 
sensory input can arise in consciousness. Not only does sensory input 
reach the cortex by the classical sensory pathways but it influences mesen- 
cephalic and diencephalic structures, via collaterals from the direct path- 
ways to modify states of awareness, attention, arousal, and emotion. In 
the case of noninformative input such as noise, the main result of this 
input may in fact be its action upon subcortical structures. In turn the 
state of the subcortical structures may, by their effects at peripheral re- 
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ceptor organs and at all levels up to the cortex, m 
sensory input. Neurophysiological studies also suggest that the cortex can 
have a profound effect on pathways through the brain stem and "indicate 
the critical role of the cortex, with its highly discriminative properties in 
the selection and transmission of sensory input" (Samuels, 1959). 

On a psychological level, owing possibly to a certain amount of rc- 
thinking which has occurred following the incorporation of information 
theory and related concepts into everyday psychological currency, the 
old idea that we cannot consider à sensory impression to be an isolated 
event has received additional emphasis. An item of sensory input must be 
considered as one in a set of possible input items having different prob- 
abilities of occurrence. It must be integrated into a pattern with respect 
to past events held in memory, it must indeed be apperceived to use an 
old-fashioned but extremely useful term. Thus the relative importance 
of different items, the degree to which they appear to the subject to 
form a subset of mutual relevance out of all other items in the sensory 
field, becomes a matter of necessary consideration. This leads our think- 
ing back to such a concept as the span of attention, or to the similar but 
not identical idea, range of cue utilization, defined by Easterbrook (1959) 
as the "total number of environmental cues in апу situation that an 
organism observes, maintains an orientation towards, responds to or as- 
sociates with a response.” From what has been said ear 
that degree of attention is under cortical and subcortical control and is 


thus subject not only to the temporary phasic emotional states of the 
organism, but also to the longer-term c 


that are manifest in mental illness. 


Because sensory input must always be thought of as having impor- 
tance to the organism insofar as it is related to, and may be integrated 
with, its past experience, the distinction among sensation, perception, 
and thinking becomes blurred. This means that in dealing with experi- 
mental studies which are Supposed to be concerned with input dy 
tion in schizophrenia, the vast literature on thought disorder must 
loom large. The boundary between the terms of reference of this review 
and another covering thought disorder would be y; 


ague and difficult to 
define. Nevertheless the attempt will be made to confine attention to 
studies which are devoted for the most part to consideration of 


into the organism, and only where relevant 10 the matter 
dealt with. The reader's indulgence must, however, 
of such clearly justifiable statements 

"Norman Cameron has defined Overinclusive thi 
ability to preserve conceptual boundaries which 
tion of irrelevant ideas, making thinking more 


odify the pattern of 


lier, it is apparent 


ortical-subcortical disarrangements 


sfunc- 
always 


input 
of how it is 
be sought in the face 
as the following: 

nking as the in- 
results in the incorpora- 
abstract and less lucid. 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 3 


Payne et al. (1959) have extended the definition, regarding it as an 
attention defect; as some breakdown in a hypothetical filter mechanism 
which normally screened out those stimuli both internal and external 
which are irrelevant to a task in hand, to allow the most efficient 
processing of incoming information" (Payne, 1962). 

It is to a large extent with the subject of Payne's extended definition 
that this review will be concerned. 

Work on disturbances of sensation and perception in schizophrenia 
earlier in the century received limited attention, perhaps because of the 
findings of Kraepelin, who, using Wundtian techniques, reported little 
sensory disturbance in schizophrenia. This position was supported by 
the later pronouncements of Bleuler on the “clear sensorium" of 
schizophrenics. It is perhaps to the Gestalt and psychoanalytic schools 
that the present work on input dysfunction in schizophrenia may most 
clearly be traced; and it is in these historical sources that the origins of 
the curious patchiness of coverage may be found. As examples we find 
considerable literature on size-constancy abnormalities, with an extension 
into work on the perceived size of body parts, and little or no work on 
other constancy phenomena. There is quite a large amount of work on 
the effect of social and aversive stimuli on conditioning and performance 
of schizophrenics and a comparative dearth of studies on the effect of a 
change in the parameters of simpler sensory stimuli. 

Alongside, and perhaps as a consequence of this patchwork coverage 
of possible dysfunctions in schizophrenia, there is an incompleteness of 
reporting manifest in many studies that makes difficulties for the later 
worker who needs more detailed information to compare with his own 
results. This is particularly the case in reporting the nature of subject 
groups. This may be due to the bias of the experimenter, who, perhaps 
testing an idea that all psychotics show a certain abnormality, lumps 
together schizophrenics and depressives whether chronic or acute. 

It is with some relief that a tendency for more complete reporting 
may be noticed in more recent papers. This, of course, is a cumulative 
phenomenon and is increasingly reinforced as experimenters report 
differences between, for instance, paranoid and nonparanoid subjects, 
between chronic and acute patients, or between process or reactive sub- 
groups. While this is a step in the right direction the warnings of Shakow 
(1963) should be heeded. Dichotomies themselves lead to oversimplifica- 
tion and frequently overlap. A rating scale to measure withdrawal may 
possibly measure nearly the same thing as deterioration or elements of 
process schizophrenia; the label paranoid when based on a scale of 
nurses’ ratings may include a mixture of incoherent and paranoid 
tendencies. 
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open to the possibilities of misinterpretation is the acute- 
ing line of more or less than 2 years stay in 
hospital may be a perfectly reasonable cutting point when based on the 
probabilities of discharge from hospital (Brown, 1960). However, the 
transition from the acute to the chronic phase of the illness is part of the 
process of the disease involving an interaction between the patient and 


his environment and the determination of the boundary between the two 
states may be much less easy. 


B. SOME THEORETICAL VIEWPOINTS 


The distinction between the acute and chronic phases of the illness 


is fundamental to the interpretation of much of the literature which 
will be reviewed. Unfortunately the evidence both on the existence of a 
clear distinction between these two states and a difference in underlying 
mechanisms, should such a distinction exist, is incomplete, Two types 
of theory will be considered. One, typified, for instance, by Mednick 
(1958) and Fish (1961), Suggests a state of high drive or arousal in the 
acute phase which subsides in the chronic phase. The other type of 
theory, Suggested by, for instance, Hoffer and Osmond (1955) and 
Weckowicz (1958), is of a parasympathetic imbalance in the acute phase 
and a late change to sympathetic imbalance as the patient enters the 
chronic phase. Both types of theory draw much the same sort of dis- 
tinction between the clinic. ic in each phase, but explain them 
rather differently. 

The point of view of the latter t 
Weckowicz (1958). "Early in the illnes 
of parasympathetic activity ack mechanisms 
[see the work of Callaway, Section ILA] ancy increases. This 
signifies increased constancy! in its turn may be 
interpreted as increased aw. s i 1 € environment. The 
of reference, attach 


Stated clearly by 
in the direction 


"€, suspicious, develop ideas 
important meanings to insignificant events and eventually develop 
delusions. They react to the environment with increased anxiety, 
because the low activity of the diencephalic Sympathetic centres is 
accompained by high reactivity. However, as the disease progresses the 
function of these centres gradually shifts in the 9Pposite direction, 
namely towards sympathetic activity." This, Weckowicz Sàys, is accom- 
panied by decreased size constancy, which is indicative of а diminished 


awareness of objects in the environment. Because of the high level of 
1 Weckowicz is here discussing the results of an experiment on 


the term “increased constancy,” or alternatiy ely “ov erconstancy,” 
of a distant object is overestimated. 


size constancy and 
implies that the size 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 5 


sympathetic activity, reactivity is low and the patients display "flatness 
of affect”; they also become withdrawn, “autistic,” lose interest in their 
environment, and sometimes have emotional outbursts which often 
cannot be related to external events. This suggestion of early para- 
sympathetic and later sympathetic imbalance is in accord with the view- 
point expressed by Hoffer and Osmond (1955), derived on the basis of 
biochemical theory. 

The increased awareness of the environment which is reported here 
as being characteristic of the acute phase is graphically documented by 
McGhie and Chapman (1961). Patients are swamped by a flood of 
sensory input which they are unable to control. They report: 

"Everything seems to grip my attention although 1 am not par- 
ticularly interested in anything. I am speaking to you just now but I 
can hear noises going on next door and in the corridor. I find it difficult 
to shut these out and it makes it more difficult to concentrate on what I 
am saying to you." 

"Things are coming in too fast. I lose my grip of it and get lost. 
I am attending to everything at once and as a result I do not really 
attend. to anything." 

Not only are there these complaints of inability to focus attention, 
to select between items of sensory input, but there are impressions of 
abnormalities in the quality of the sensory input. 

“Noises seem to be louder to me than they were before... . I 
notice it most with background noises." 

“Colours seem to be brighter now almost as if they are luminous." 

“The colours of things seem to be much clearer and yet at the same 
time there is something missing. The things I look at seem to be flatter 
as if you were looking just at the surface." 

Here we have reports of the lack of distinction between relevant and 
irrelevant. background material and of evident changes in distance 
constancy. 

The lack of ability to distinguish "figure" from "ground" is par- 
ticularly apparent in the perception of speech. 

“When people are talking I just get scraps of it. If it is just one 
person who is speaking that's not so bad but if the others join in then 
I can't pick it up at all. I just can't get into time with that conversation. 
It makes me feel open as if things are closing in on me and 1 have lost 
control." 

“I’m a good listener but often I'm not really taking it in. I nod my 
head and smile but it's just a lot of jumbled up words to me." 

In addition to this disturbance of ability to distinguish relevant 
from irrelevant items and to place them in their correct order, there also 
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appears to be an inability to absorb items of inforn 
presented too quickly. This is particul 
the rate of sensory input is brought 
ment. 


"When I move quickly it's a strain on me. Things go too quick for 
my mind. They get blurred and it's like being blind. It's 
seeing a picture one moment and 
and watch my feet. Everything is 
again I lose control." 

Heightened awareness of bodily sensations brings about a loss ol 
automatic Spontaneity of action; no longer can a sequence of motor acts 
be carried out without consci 


ous deliberation because the patient is 
made aware of each stage. Thus, as the 


has to be planned and execut 
Scious deliberation. 
‘self conscious’ in 


nation if they are 
arly the case where an increase in 
about by the patient's own move- 


as if you were 
another picture the next. I just stop 
all right if I stop, but if I start moving 


authors say, “each action now 
ed step by step with a great deal of con- 


The patient finds himself becoming increasingly 
an entirely literal sense.” 


“People just do things but I have to watch first to see how you do 
things. I have to think out most things first and know how to do them 
before I do them." 

This clinical material has b 
presents in a particularly graphic ma 


Б €xperimental studies, 
of the nature of sensation in the 


$ seem to be available, the main 
evidence for a decreased awareness of the full range of the subject's 


environment must rely to a large extent on experimental data, which 
will be presented later (Section ILB). 

McGhie and Chapman 
clinical material which. they Present theoretical position 
is perhaps all the more interesti 
tion which was psychoanalytic 
observations to reject the propos 
symptoms are defensive 


н : nscious conflicts and to 
suggest that the basic pathological breakdown is Particularly to be found 
in the process of perception. From this they Say that ‘ 


talt analysis of the 
Ocated by Conrad (1958), 


which he modifies in terms of Hebb's (1949) theory of perceptual 


м 
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organization, Fish reaches the conclusion that early phases of acute 
schizophrenia are due to “overactivity of the reticular system producing 
an undue diversion and disruption of central processes by the sensory 
input.” In the presence of reticular overactivity sensory events thus 
acquire greater than normal significance, thought processes are dis- 
organized, and anxiety results. When this disruption of the central 
processes by sensory input occurs frequently a “feeling of some unknown 
significance is produced by the increased impressiveness of all percep- 
tions." From these processes delusions arise. 

As the next stage, Fish proposes that parallel central reverberatory 
processes develop which are detached from sensory input and are not 
reorganized by it. This state of affairs may continue to exist in the 
chronic state after the high reticular activity which engendered it has 
ceased. It is of interest to note that Fish proposes three types of chronic 
state: In one the reticular activity does not subside and a paranoid 
illness remains in which delusions continue to be aflect laden. Second 
and third, reticular activity subsides but the cortical reorganization 
remains either in the same form as in the acute processes, or in a form 
different [rom this but less efficient than the premorbid state. Fish backs 
up his case for reticular overactivity by several pointers from physiology 
and pharmacology. For instance, noting that amphetamine overdosage 
will produce a psychosis which is clinically undistinguishable from 
paranoid schizophrenia, he points out that amphetamine produces an 
arousal response in the EEG in animals which is similar to that produced 
by reticular stimulation. This psychotomimetic effect of amphetamine 
should be borne in mind when experiments on the effect of alerting 
agents on the breadth of attention are discussed at a later stage (Section 
ILA). 

In contrast to the previous theories, in that it remains on a be- 
havioral level, is the "learning theory approach" of Mednick (1958). 
In spite of this difference the picture which results has similarities to 
many of the foregoing theories and also to that of Rosenzweig (1955). 
In both these proposals it is the erection of unrealistic defences against 
anxiety which results in psychotic behavior. In Rosenzweig's theory the 
anxiety results because of disturbances in central processes which 
normally integrate ideational and affective processes; in Mednick's theory 
the anxiety results from а combination of factors due to a high drive 
state which is said to characterize the acute stage of schizophrenia. 'This 
high drive has three results: “(а) schizophrenics more easily acquire a 
conditioned response; (b) schizophrenics show greater stimulus generali- 
zation responsiveness; (c) schizophrenics have great difficulty performing 
well in complex situations being plagued by irrelevant tangential 
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associative. responses competing with adequate modes of response.” 
Mednick goes on to elaborate his theory in terms of 
tation mechanism. An untoward event will 
high drive level, raise his anxiety level; cor 
of stimulus generalization will incre 


à reciprocal augmen- 
; in a person with an already 
sequent upon this, the range 


ase and further stimuli will acquire 
anxiety-producing qualities. This process continues until the behavior of 


the individual becomes noticeably unusual. High drive level keeps 
thoughts racing in the patient's mind and a rationalizing solution is 
evoked to reduce anxiety. Lack of discrimination js brought about by 
high generalization and any associated response, however tangential, 
may produce anxiety reduction resulting in thought disorder. The 
anxiety reduction which may be achieved by merely thinking irrelevant 


thoughts results in the patient displaying less and less emotionality and 
the chronic Stage of the illness is reached. 


These theories which have been presented 
clinical pictures, the acute stage with its excesss 
overgeneralization followed by a transition to 
diminished awareness of outside stimuli and y 


description of the clinical state roughly Corresponds in the various 
theories, the suggested underlying mechanisms tend to be somewhat 
opposed. On the one hand Weckowicz, and Hoffer and Osmond suggest 
à parasympathetic overactivity in the acute Stage followed by a transition 
to a sympathetically aroused and cortically activated picture in the chronic 
patient. On the other hand Fish and Mednick Propose high reticular 
activity or high drive in the acute state which later subsides ir Ке patient 
becomes chronic. The remainder of this chapter will be наа: пог 
so much with theories but w ‘amination of the experimental 


justification for them, and an attempt to see how far the rather divergent 
viewpoints about mechanisms mz M Э 
Onciled or verified, 


describe rather similar 
of sensory input, and 
a chronic stage with 
vithdrawal. Although the 


ith ап e 


II. The Span of Attention 


A. CONDITIONS BRINGING ABOUT ALTER 


ATIONS IN SPAN 
IN NORMAL SUBJECTS 


OF ATTENTION 
It was suggested earlier (Section I,A 


were influenced by subcortical as well 


examination of relevant experiments on normal sup; 


; А as have subcortical effects is valuable 
insofar as it may enable insight to be ра 


the interpretation of experiments with 

st relevant of these are a series of studi à 
the most releva idies ay and his 
ain drugs and 
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procedures on perceptual tasks which are interpretable in terms of 
extent of attention. 

Callaway and Thompson (1953) proposed an hypothetical mechanism 
which they then proceeded to test, by which the activity of the sym- 
pathetic nervous system and the breadth of attention are related. They 
suggested that a threat to the organism which induces a sympathetic 
discharge must in turn bring about a rise in the threshold of the input 
system which will tend to reduce the perception of the threat and hence 
reduce the sympathetic discharge (i.e., a negative feedback mechanism). 
Otherwise if increased sympathetic discharge induced a lowering of 
threshold there would be an increasing rise of sympathetic discharge to 
an intolerable limit. (This model was later modified because it was found 
that the common factor in treatments producing narrowing of attention 
was that they all produced an alert EEG.) The model was tested by the 
use of a size-constancy experiment. They suggested: "We should expect 
increased sympathetic activity to decrease that general awareness of 
gradients by which we correct our shrunken retinal image of distant 
objects. Objects would then be perceived as smaller than their true size 
and would seem to approach a size more nearly proportional to their 
retinal image size." 

The size-constancy experiment which was carried out involved the 
adjustment of the size of a projected rectangular patch of light 200 cm 
away from the subject to match the size of a rectangular card held in the 
hand, To induce sympathetic discharge a cold pressor test was employed 
in which the subject’s foot was immersed in a bucket of ice water. ‘This 
procedure evoked a reduction of size constancy in comparison to that 
obtained under nonstimulating conditions. Because the cold pressor 
procedure results in hypertension which would produce increased retinal 
blood flow in addition to sympathetic discharge, the experiment was 
repeated with inhalation of amyl nitrite as the sympathetic stimulant. In 
contrast to the previous procedure amyl nitrite induces hypotension. 
The previous result was replicated. Various possibilities which might 
result in the perceptual effect were considered in further experiments; 
explanations in terms of change of perceived distance to the adjustable 
stimulus and change of actual size of retinal image akin to atropine 
micropsia were tested and dismissed. An experimental situation in which 
there was a reduction in subsidiary cues abolished the effect earlier 
noticed so that the conclusion was reached that decreased size constancy 
after induced sympathetic adrenergic discharge resulted from reduced 
awareness of subsidiary cues which normally act to correct the diminu- 
tion in retinal size of distant objects. Callaway and Dembo (1958) and 
Callaway (1959) reported further work on the phenomenon of changes in 
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breadth of attention which t 
findings reported with amyl nitri 

of subjects and other experiments usin 
mine showed similar changes in size 
ments were eventually ab difficulty in showing 
directly whether the effect was due to decreased awareness of subsidiary 
cues or had some other ex lanation. Among others a task which these 
workers used for further experimentation was that of perceiving the 


attern of onset of lights in a binary guessing 
e.g., Hake and Hyman, 1953) that subjects’ 
inary choice game such as thi 

the actual frequency with which 
correct. Callaway et al. 
measure of narrowed 
of attention will be h 
answer. . 


the two choices are reinforced as 
procedure may be used as a 
guessing game the central focus 
by the current guess and the current 


е continually peripheral to current 
guesses, guess frequency response to answer frequency change reflects 


the influence of peripheral factors upon behaviour, In other words it can 
be used to indicate narrowed attention.” In Specific terms the experi- 


ould produce narrowing of 
1 ^5 responding more slowly than 
control subjects to a ch i i atistical pattern, 

It should perhaps be noted t that although what ap- 
parently are on the Periphery here are memory traces and not true 
sensory input, we have a position which is analogous to an information- 
conveying input situation where the immediately Present stimulus should 
be looked at as one of a set of stimuli, some of which are presently 
absent. 
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€ third card, names of 
Ts used in the first two 


€ four colors. On th 
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cards, the name of the color being different from the color used to 
print it. Using Card I the subject has to read the names of the colors, 
with Card II he has to name the colors of the dots, and with Card III he 
has to name the color of the ink used to print the names of conflicting 
colors. It is assumed that performance on Card III will be facilitated 
by narrowed attention acting to exclude the printed name of the color. 
The effect of methamphetamine is in fact to improve performance 
relatively оп Card III. 

In summary, Callaway states that the procedures which result in 
narrowed attention, the cold pressor test, and the use of amyl nitrite and 
methamphetamine, can all be said to have a sympathetic action; on the 
other hand, he also reports that an anticholinesterase nerve gas has 
similar effects. The common factor in all the procedures is that they 
result in an alert EEG. Whereas, however, the amphetamines produce 
both behavioral arousal (shown in animals by opening of the eyes, 
movements of the ears, etc.) and EEG arousal (Bradley and Elkes, 1957), 
anticholinesterases, such as phystostigmine and nerve gas, produce EEG 
arousal without behavioral arousal. Callaway's suggestion is that narrow- 
ing of attention may be the missing behavioral correlate in this case. 

Аз а corollary of these findings it can be suggested that a drug which 
produces the opposite of EEG arousal should produce “broadness of 
attention.” This possibility was investigated by Callaway and Band 
(1958) using atropine and Callaway (1959) using amobarbital. Atropine, 
while producing high-amplitude slow-wave activity in the EEG indicating 
drowsiness, is reported not to reduce behavioral alertness. It acts as a 
reticular formation inhibitor but has minimal sedative effects and is not 
considered to be a cortical inhibitor. Using the Stroop, the Gottschaldt, 
and the Luchin’s water jar tests, results were obtained with atropine 
which were consonant with the idea of “broadened attention." Amobar- 
bital was found to have a similar effect in the later study although some 
reservations in the interpretation of the results must be felt because of 
skewed distributions due to performance near to task ceilings. 

The finding of narrowing of attention in a situation likely to pro- 
duce sympathetic discharge is shown in a study by Kohn (1954). In this 
experiment subjects read stories in conditions which could be described 
as threatening, distracting, or neutral. The threatening situation was a 
stage managed presentation with high-voltage sparks, and the suggestion 
of an intense shock across the subject’s temples, and thus likely to pro- 
duce an intense emotional reaction. The distracting condition was 
similar but with a reassurance that no shock would be delivered. The 
stories were recalled later in a neutral situation. It was found that both 
the relevant and irrelevant items of the story studied under threat of 
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shock were less accurately reproduced than those studied under dis- 
tracting or neutral conditions; however, under threat 
occurred in irrelevant than relevant items. Т 
emotional arousal narrowed the field of attention at the time of the 
original learning to the relevant points of the story. Жей de 

Callaway's results with the Stroop test are directly supported "i 
Agnew and Agnew (1963). "The state of the subject's drive was шайр 
lated by the use of threat of shock and induction of feelings of failure 
if above-average performance was not achieved. The effectiveness ol A 
procedure was checked by the finding that heart rate was significantly 
increased under “high drive” conditions. It was found under these condi- 


i ‘ T nl red p k or 
tions, in contrast to “low drive conditions, without threat of shock 
failure, that performance on the Stroop test w 
attention. 


Callaway’s studies su 


a greater reduction 
hus it was suggested that 


as indicative of narrowed 


85651 that narrowing of attention occurs under 
conditions which increase sympathetic activity or produce an alert EEG. 


The Agnews’ and Kohn's studies lead to very much the same conclusion 


in that a situation threatening enough to evoke sympathetic discharge 
produced narrowing of attention, 


Bharucha-Reid (196 
the intake of stimuli 
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of its own perceptual diminution. While 
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rather dissimilar viewpoint th 
individual's perceptual field 
would appear from the experi 
аз an empirical observation № 
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also supporting the notion of 
at Snygg and Combs (1949) state the 
at the effect of the threat is to reduce the 
to the area of the perceived threat. It 
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hose teleological significance is unclear. 
Bharucha-Reid suggests four Ways in which the mechanism of the 


“behavioral anesthetic” might work. First, by an increase in activity in 
the subcortex the Sensory receiving areas may be disor, 

the effect being manifest possibly in a particular area of the cortex 
representing a single sense modality. Second, even when Sensory input is 
"allowed to be received" it may still not be elaborated for awareness by 
being received without the presence of adequate secondary reinforcing 
activity from the subcortex. A third possibility is the Suppression of 
sensory input by the cortex itself, while а fourth is the alteration of 
cortical activity by the diffuse and specific projection Systems acting with 
a particular time relationship. The trouble with these Proposals is the 
wealth of possible explanations which they offer. 


- We should perhaps 
make some attempt to delimit more accurately the Specific conditions 
a 


ganized selectively, 
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which bring about the attention-restricting behavior before relying on 
such a general notion as "stress." While stress may not be the correct 
term to use for sexual excitement it is of interest to note that Kinsey et al. 
(1953) report not only narrowing of attention in the form of inability 
10 visually perceive actions taking place in the surroundings, but also 
raising of tactual and pain thresholds during orgasm. Unfortunately, 
apart from changes in the cardiovascular system, physiological changes 
accompanying sexual excitement are poorly documented and we do not 
seem to be entitled to say whether narrowing of attention in these 
conditions is accompanied by increased sympathetic or fast EEG activity. 

Drew (1963), discussing work done during World War П (Davis, 
1946a,b; Drew, 1940), reported a gradual restriction of the field of 
attention during a prolonged fatiguing performance on a task resembling 
flying. Instruments at the extreme periphery ceased to be responded to, 
or signals were frequently missed, more so than those nearer the center, 
until eventually even the central flying panel seemed to be responded to 
one instrument at a time. Drew reports this behavior as occurring in the 
context of the subjects increased irritability and presumably therefore 
increased arousal. In discussing this work, Drew makes an important 
point. He suggests that the term "tunnel vision" which is the label given 
to this restriction of the attentive field is a misnomer and an artifact of 
experimental design. It suggests a restriction from the periphery to 
foveal areas while attention actually “becomes restricted to that part of 
the field from which stimuli are expected, whether this be central or 
peripheral. In the usual experimental design, the main task is presented 
centrally, with less important or less frequent stimuli to the periphery. 
Under these conditions, restriction is from the periphery to the centre. 
If the design is changed, however, the restriction will occur to whatever 
part of the periphery given the most frequent or most important 
signals." 

'The finding of restriction of attention to the central task is shown 
clearly in an experiment by Bahrick, Fitts, and Rankin (1952). The effect 
of high and low incentives оп performance on a central tracking task 
with a subsidiary task involving response to intermittent peripheral 
stimuli was studied. Results showed that the high incentive facilitated 
performance of the central task but in general interfered with the per- 
formance of the peripheral task. 

The picture arising from this cursory review of some of the relevant 
literature is that narrowing of attention occurs under conditions which 
may be loosely classed as arousing, that is, under conditions that might 
be considered to produce alert EEG or to evoke sympathetic discharge. 
'This is in essence the position which is put forward in an extensive re- 
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view by Easterbrook (1959), who says, "It 
generalization that, w 
in drive is associated 


is an empirically derived 
hen the direction of behaviour is constant, increase 
range of cue use... . The 
al arousal Or covert general 


aneously with overt action. 
В зр x A 7 1 
These phasic conditions which are Productive of narrowed attention ir 


normal subjects should be borne in mind when one is considering 
perceptual dysfunction in the presence of longer-term disorders of 
activation, or arousal, which may be said to characterize certain stages 
of the schizophrenic illness, 

A possibility which has been 
perhaps consider two forms of n 
instance, size-constancy 
peripheral visual cues 
narrowing concerned wi 
binary guessing game used by Call 


glossed over so far is that we should 
arrowed attention. In the case of, for 
experiments and those involving awareness of 
ith. spatial narrowing OY 
a moment in time. In the 
away, however, the concern is with 
temporal narrow atter should more correctly be termed 
a defect of memory is a matter for discussion but it is clearly connected 


with such terms as “set” in the Rodnick and Shakow (1940) sense of 
“maintenance of a set to respond,” 


B. EXPERIMENTS ON 
1. Spatial 


SCHIZOPHRENICS 


a. Size Constancy. 
between size-constancy experi 
in the span of attention i 
and cues in the peripheral field th 
object is corrected and appe 
distance. 1f, owing to narr 
perceived, this correction w 
appear smaller. 


he Phenomenon of alterations 
i Y perception of objects 
ed retina] image of an 
ize when viewed from a 
» Peripheral cues are not 
and the distant object will 


A study which directly relates the work on normal Subjects, par- 
ticularly that of Callaway (Section ILA), with Studies of size constancy 
in schizophrenia is that of Weckowicz (1958). In this experiment the 
schizophrenic patient had to adjust the length of a vertical rod, placed 
near him, so that it had the same apparent size as that of a distant 
standard. In addition to this measure of size с 


Wb: onstancy, the patient's blood 
pressure response to a mecholyl Injection (Funkenstein, Greenblatt and 
Solomon, 1949) was measured. It was found that “schizophrenic 
in whom there is a rise in blood pressure after a mecholy] injecti 


to give a higher estimation of the size of distant objects and t 


patients 
on tend 
herefore 
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nave a higher and better preserved size constancy, while those schizo- 
phrenic patients in whom there is little or no rise of blood pressure tend 
to underestimate the size of distant objects to a greater degree, thus 
having decreased size constancy.” Using the mecholyl test as a measure of 
sympathetic reactivity (Gellhorn and Miller, 1961), Weckowicz reasons, 
no doubt on the principle of “the law of initial value" (e.g., Wilder, 
1958), that sympathetic reactivity is greatest in those patients with the 
lowest sympathetic tonus and vice versa. A high level of sympathetic 
tonus in a group of schizophrenics therefore tends to be related to a 
decrease in size constancy, and is in accord with findings on normal 
subjects. Earlier Weckowicz (1957) carried out an experiment on chronic 
schizophrenics whose clinical picture, after a mean hospital stay of 7 
years 9 months, was that of hebephrenia and schizophrenia simplex, 
although some initial diagnoses were of paranoid schizophrenia. Non- 
psychotic hospitalized patients and normal controls were also tested. 
Again the task was that of matching the size of a nearby adjustable rod to 
that of a standard placed at 7.5 or 15 meters from the subject. In com- 
parison with the other groups schizophrenics underestimated the size of 
the distant object and did this increasingly so at the greater distance. 
This finding of decreased size constancy in chronic schizophrenics is 
replicated in a study by Hamilton (1963). In this experiment a variety of 
stimulus objects were used. These objects (standards) were placed 1 meter 
away from the subject; the variable was 3 meters away and under control 
of the experimenter. The positions of standard and variable are thus the 
inverse of those of Weckowicz (1957) but similar to those of Callaway and 
"Thompson (1953). An additional feature was the use of two conditions: 
one, “cueless,” in which judgments were carried out with the stimuli 
exposed in a darkened tunnel, so that peripheral distance cues were 
reduced; the other, a “cue condition" carried out in full daylight. Thus 
if the reduced size constancy of the chronic schizophrenic is due to his 
inability to perceive peripheral cues because of narrowed attention, his 
performance should not differ significantly from normal when the normal 
is deprived of distance cues. The study was carried out on chronic non- 
paranoid and chronic paranoid schizophrenics, a small group of manic 
depressives, neurotics, and normal controls. In general it was found that 
there were no differences in performance between normals and neurotics 
or among the group of psychotics. There was in general, in conditions 
where cues were present, a significant tendency for the psychotic subjects 
to display underconstancy in comparison with normals and neurotics, 
whose scores tended to approach values for perfect constancy. In the 
absence of distance cues the size judgments even of normal control sub- 
jects were liable to considerable error. In the cueless conditions only two 
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of the seven objects used as stimuli gave normal-psychotic constancy 
differences which were significant, It is important to note that these two 
objects, a penny and a packet of cigarettes of well-known brand, were 
familiar and of fixed size, while of the remainder of the stimuli, two 
diamond-shaped cards, a rod, a playing card, and a medicine bottle, 


only the playing card had a fixed size. Hamilton employed an index of 


cue responsiveness in which the size constancy in "cue" and “cueless 


conditions was compared. It is of considerable interest to note that, for 


the total psychotic group, reduced 
correlated with low cu 


mechanism suggested 


size constancy was significantly 
» 4$ would be expected from the 
of this section. 

ligence which has been shown to 
deficit shown by schizophrenics (Hunt 
ance to note that Hamilton confirms the 
negative relationship between size con- 
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€ responsiveness 


In view of the decrease in test intel 
occur as part of the Psychological 
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In some contrast to these findings is 
who investigated the size const 
phrenics and a group of norm 


an experiment by Raush (1952), 
and nonparanoid schizo- 
does not give details of 
Such statements as “the 
of hospitalization” and 
ths Prior to the experi- 
ation that was used, In 


ments” that it was a relatively chronic popul 
Raush's experiment the standard stimulus was a disk of light 20 feet away 
and in front of the subject; the variable Stimulus was д disk of light 
controlled by the subject and placed on a table 4 feet away from the 
subject and at right angles to the Standard, Trials were given in the 
dark, i.e., cueless conditions, and with room lights ful on, ie, cue 
conditions. The results of this experiment were that al] Subjects showed 
overconstancy in both conditions. However, the only Significant dif- 
ferences were those between the paranoid and DOnparanoiq schizo- 
phrenics, апа the paranoid schizophrenics and normals in the cueless, 
dark conditions, the paranoid patients showing higher constancy than 
either of the other groups. This very different picture from that obtained 
by the previous investigations is difficult to explain. The general over- 
constancy of all subjects was thought by Raush to be due to a space 
error arising because of the 90-degree angle between standard and 
variable. This angle was 5 degrees in Hamilton’s experiment аһа was of 
a similar order in the experiments by Weckowicz and Callaway, Holway 
and Boring (1941), using a similar orientation of standard anq variable 
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to those of Raush’s, obtained a larger degree of overconstancy with their 
normals than did Raush. This finding is in many ways similar to that of 
Klein (1954), who found that size judgment from memory gave consistent 
overestimations in comparison with conditions where size judgments were 
made with the standard in the peripheral field. Clearly the process of 
comparing stimuli at right angles as in Rausch's experiment involved 
memory, however immediate. Another puzzling feature is the fact that 
differences which are shown are found in the cueless condition; however, 
while the normals and nonparanoids give slightly less constancy in the 
dark than the light conditions the overconstancy of the paranoids is in- 
creased under these conditions. The main contrast between this and the 
Hamilton experiments, given that the experimental conditions brought 
about a general bias to overconstancy, is the finding that the paranoid 
schizophrenics behave differently from the nonparanoids. This finding of 
exceptional behavior by paranoids will be discussed in Section V. 

It is perhaps noteworthy that Raush's schizophrenic population was 
on the whole younger (about 25 years old) than Hamilton's (10 years) or 
Weckowiczs (36 years) Not only therefore are they likely to be less 
chronic, but according to Raush their diagnosis had not changed. 
Weckowicz noted that although some patients in his experiment had 
been diagnosed as paranoid on admission this diagnosis no longer ap- 
plied. The point that is being made here is not that age in itself affects 
size constancy—indeed Hamilton has shown no differences with age— 
but that age may reflect differences in chronicity and possible changes in 
diagnosis. Hamilton showed no changes in size constancy with chronicity 
when his group of patients was divided at more or less their 5 years stay 
in hospital. It can be suggested, however, that the change in physiological 
State which parallels an acute-to-chronic change occurs very early in 
hospitalization, and by division of his patient group at more or less than 
five years stay Hamilton would be unlikely to show any differences. 

A criticism of Raush’s study which is made by Lovinger (1956) is that 
Raush used patients who were in good contact with reality so as to 
obtain optimal conditions for size-constancy testing. This selection would 
tend to bias the results in the direction of not finding reduced size 
constancy. Lovinger therefore carried out a study in which two groups 
of schizophrenics in good and in poor contact were selected. The selec- 
tion of patients was made on the basis of ratings of contact and whether 
they lived in open or closed wards. Using illuminated disks as targets in 
line and not a right-angular arrangement of standard апа variable, 
Lovinger found that “under experimental conditions involving minimal 
distance cues schizophrenics in poor contact manifested less size con- 
stancy than either schizophrenics in good contact or normals.” It is of 
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importance to note that Lovinger used three cue conditions: (1) 
maximum, with full lighting, (2) minimal, with reduced lighting, and 
(3) no cues, darkness. In both the last and the first conditions no dif- 
ferences due to diagnosis were found, probably because even the poor- 
contact schizophrenics noticed peripheral cues under full lighting, and 
on the other hand, in darkness the normals were as handicapped as the 
schizophrenics. The possible intrusion of distance cues even for out-of- 
contact schizophrenics would seem to be a likely explanation for the 
finding (Leibowitz and Pishkin, 1961) of fully maintained size constancy 
in a group of chronic schizophrenics chosen as very withdrawn. The 
experiment was carried out under fully lit conditions, which might have 
resulted in extreme prominence of distance cues. 

Probably related in a loose fashion to the degree of reality contact 
used as a factor in Lovinger's experiment is thought disorder. Weckowicz 
and Blewett (1959) started from the notion that paranoid schizophrenics 
who show little thought disorder also show a well-maintained size con- 
stancy (Raush, 1952), while schizophrenics who show poor size constancy 
are also likely to be thought disordered. Using 40 chronic schizophrenics 
as subjects they found that there was a positive correlation between poor 
size constancy and imparied ability for abstract thinking. Weckowicz and 
Blewett interpret their result in terms of an inability to “attend 
selectively" or *to select relevant information” which is responsible for 
both size constancy and thought disorder. This interpretation of size 
constancy would seem to be at variance with the ideas of "decreased 
general awareness of the gradients by which the shrunken retinal image 
is corrected" which is said (Weckowicz, 1957) to result in decreased size 
constancy. The relationship of narrowed attention to thought disorder 
will be discussed later in the context of overinclusion. 

A study which has been included in this section because of its very 
close relationship to size constancy is that of Hartman (1962). This work 
is on the apparent size of after-images and is a rough converse of size-con- 
stancy experiments in that there is a fixed retinal image size which accord- 
ing to Emmerts law changes its phenomenal size with perceived distance. 
Quoting experiments by Edwards (1953) and Boring (1940), Hartman 
suggests that if all cues to distance are removed from the viewing situa- 
tion the size of after-image will depend on its retinal size and will thus 
remain constant. Two groups of schizophrenics and two normal groups 
were tested in this experiment; one group of patients was categorized as 
delusional and the other as nondelusional. In both cases the length of 
hospital stay was about 4 years. While the scanty reporting of actual 
details of results makes it difficult to discover actually what happened 
it was reported that “chronic delusional schizophrenics differed from 
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chronic nondelusional schizophrenics and normals in their perception 
of size changes of the after-image, i.e., they were less likely to respond 
as would be predicted on the basis of Emmerts law." No subject, normal 
or schizophrenic, was able to report after-image changes as distance varied 
when only monocular cues were available; convergence cues, however, 
did bring about a difference in after-image size but less in delusional 
patients than other subjects. These cues were only operative at short 
distances. While showing that paranoid patients are apparently different 
from the rest, the light which this single study throws on the size- 
constancy findings is unfortunately small, apparently because few dis- 
tance cues were available for even normal subjects to use. 

b. Distance Constancy. Callaway and Thompson (1953) failed to 
ance judgment was affected under sympathetic 
ld pressor test, and thus dismissed this as a 
factor responsible for changes in size constancy under stimulating condi- 
tions. Weckowicz, Sommer, and Hall (1958) carried out an out-of-doors 
distance judgment experiment which showed that chronic schizophrenics 
underestimate distance. Schizophrenics, when asked to judge successive 
equidistant intervals, had a steeper perceived gradient and reached zero 
sooner than normals. The perceptual field of the chronic schizophrenic 
is apparently lacking in depth, he lives in a “flatter world.” Size-constancy 
Scores obtained in a similar way to Weckowicz's other experiments were 
available for the patients in his experiment. It was found that poor size 
constancy was significantly correlated with poor distance judgment. This 
correlation has been criticized by Rump (1961a,b) on the basis of the 
lack of validity of the "invariance hypothesis" of a reciprocity between 
perceived size and perceived distance. Hamilton (1963), who asked his 
Subjects to make distance judgments at the end of his size-constancy 
experiments, agreed with Rump's criticism of Weckowicz et al., insofar 
as he found almost wholly negative and nonsignificant correlations be- 
tween distance judgment and size-constancy measurements carried out 
on the same apparatus in contrast to the separate determinations of 
Weckowicz et al. Hamilton’s results do, however, show, in agreement with 
Weckowicz et al., that nonparanoid schizophrenics underestimate dis- 
tance, but also that paranoid schizophrenics markedly overestimate dis- 
tance. It was also found that cue conditions benefited the distance judg- 
ments of nonpsychotic subjects more than those of the nonparanoid 
schizophrenics. It might thus appear that distance judgment deficiencies 
are possibly interpretable in terms of narrowed attention, and hence 
while full appreciation of peripheral cues 15 a necessary condition for 
good size and distance judgments the mechanisms responsible for each 
are independent. An experiment carried out by Nelson and Caldwell 
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discharge evoked by the co 
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(1962) gave no indication of an inaccuracy of depth perception in acute 
schizophrenics. This experiment was, however, carried out in a rather 
different way from those earlier and involved the positioning of a vari- 
able target to have the same apparent distance as a fixed target, an 
apparently easier task than, for instance, making a judgment of distance 
in feet (Hamilton) or the doubling or halving of standard distances 
(Weckowicz et al.). Although it was not significant, there was a tendency 
for the acute schizophrenics in Nelson and Caldwell's experiment to be 
more accurate than normals when using a post as a stimulus, possibly 
indicating some degree of overconstancy in these acute patients. Other 
objects which might have acted as affective stimuli were used, pictures 


of a woman, dog, man, or circle, but while there was some tendency for 


normals to show an order of placement of items this was not shown by 
schizophrenics, 


c. Perception of Relevant and Irrelevant Material. Venables (1963a) 
carried out an experiment concerned with the degree to which schizo- 
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the basis of each individual regression line Summarizing the trend for 
the second, third, and fourth sorting, a time for a fifth sorting, had the 
irrelevant letters not changed, was estimated, This was compared with 
the observed time for the fifth sorting. It was suggested that the degree 
to which the subject had paid attention to the irrelevant letters would 
be reflected in the disturbance shown by the difference Betwean dtre edi 
mated and observed times for the fifth sorting, Following Callaway 
(Section ILA) it was thought that narrowed ould be related 
to the degree of cortical activation shown by AS ian index of 
cortical activation the subject's threshold of f 

was measured. Strictly speaking the two-fla 
of as a measure not of cortical activity 
temporal resolution. Good temporal resolution, involves short 
cortical recovery time, is related to the refractoriness of the cortex, It is 
evident that a second stimulus may not give rise to 4 Cortical res None if 
it falls into a period of complete refractoriness after an initial А 
'The shorter this period of refractoriness the better D 
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in time appears from the experiments of Lindsley (1958), Steriade and 
Demetrescu (1962), and Schwartz and Shagass (1963) to be under the 
control of the reticular formation whose ascending activating function 
is usually thought of as increasing cortical activation. Thirty-four chronic 
schizophrenic patients were tested and their two-flash fusion thresholds 
correlated with the degree of card-sorting disturbance used as a measure 
of narrowed attention. It was found that high cortical responsivity was 
significantly correlated with narrowed attention, thus supporting Calla- 
way's findings. It had been found previously (Venables and Wing, 1962) 
that there was a relationship between two-flash threshold and a rating 
scale measure of withdrawal (Venables, 1957) such that high cortical 
activation was found in the most withdrawn patients, if these were non- 
paranoid or incoherent paranoid schizophrenics. Narrowed attention 
could thus be expected to be related to the degree of withdrawal shown 
by the subject. For the 26 nonparanoid patients in the sample this rela- 
Lion was significant. However, no relationship was evident for the remain- 
ing eight coherently paranoid patients. Similar data relating withdrawal 
and narrowness of attention showed a positive relation between the two 
measures in another group of nonparanoid patients and no relationship 
in a larger paranoid group. These results should be compared to those 
previously reported where for instance Lovinger (1956) found reduced 
Size constancy in a group of patients having reduced contact with their 
environment and hence most probably comparable to the withdrawn 
patients in Venables’ study. The lack of relation between withdrawal 
and narrowness of attention in paranoid patients also is in line with 
the findings previously reported of differences between these patients 
and other schizophrenics. The need to distinguish those paranoid pa- 
tients having coherently expressed delusions from 'those whose delusions 
ате expressed incoherently and who behave in many ways like non- 
paranoids (Venables and Wing, 1962) may account for the lack of dif- 
ference between paranoid and nonparanoid schizophrenics in Hamilton's 
(1963) study. 

The idea of stimuli as relevant or irrelevant to the task in hand is 
at the basis of studies of incidental learning. Greenberg (1953) had a 
Broup of 44 chronic schizophrenics and 44 nonpsychiatric patients per- 
form three experimental tasks: (1) “Color-Position” in which the recall 
of the colors of geometric forms was the directed or intended task and 
the recall of the position of the form the incidental or undirected task, 
{2) “Paragraphs,” in which the recall of the content of one paragraph 
from the Wechsler Memory Scale was the intentional task and the recall 
of the second paragraph the incidental task, when the instructions called 
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) ‘ © present heading Is that of Dunn (1954). T , 
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Psychiatric hospitalized со 


7 ntrols. Four ү 
to the subject, including three 


Уреѕ of scene were exposed briefly 
son” pictures showing 
- n the 
al picture having 
а tree. There were five 
“Ps of detail change, such 
Pictures. The subject 5 
er there was any difference ina pair of pictures 
or whether they were the same. It Was found that in the whipping and 
scolding Pictures the schizophrenics were Significantly Worse than normals 
in the observation of detail; this diffe i reach significance for 
the feeding and house-and-tree Pictur К is is not Dunn's interpre- 
tation these results may be viewed as supporting awe » narrowing of 
attention in chronic schizophrenics exaggerated by the emotion evoked 
by the whipping and scolding Pictures, е, however, “ул. Ай 
emotional reaction is evoked by these pic Subjects and it is 
only the combined effect of this on top aroused schizo- 
phrenic “habitus” that is sufficient to produce the Narrow} 


l = H . ng Of attention 
which brought about nonperception of detail in the Pictures, 
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2. Temporal 


a. Perception of Statistical Structure. So far we have discussed ex- 
periments interpretable in terms of narrowing of attention that is spa- 
tially orientated. In this section we turn to experiments which can be 
interpreted in terms of temporal narrowing. One of the tasks which 
Callaway (Section ILA) used to test the hypothesis of narrowing of 
attention under autonomic stimulation was that of binary choice guess- 
ing. He showed that subjects receiving methamphetamine changed the 
statistical structure of their guesses to that of the reinforced answer 
pattern more slowly than a group not receiving methamphetamine. The 
proposed mechanism of this phenomenon is that past answers and 
guesses, being in the periphery of attention, are not so readily perceived 
under conditions of narrowed attention and are not responded to. 
Changes in guess pattern do not then take place so readily. 

A somewhat similar experiment was carried out by Ashman (1959) 
on a group of schizophrenics who were described as first admissions with 
à median age of 35 years. The length of stay in this first admission was 
not described, but unless the group contained a large proportion of 
paranoid patients it might be suggested that at 35 years the group be 
considered comparatively chronic. 
divided into five subgroups, each having a 
interpolated, and final tasks. The initial and 
final tasks all consisted of 10076 reinforcement for one response, while 
the interpolated tasks had different patterns of 50:50 reinforcement. 
Ashman summarized his results as showing that "schizophrenics do not 
appear to be able to utilize prior probability learning on responding 
appropriately to later patterns." They tend to perseverate to a reinforced 
response after the cessation of a reinforcement renders the response in- 
appropriate in terms of a different pattern of events. 

Employing a more complicated statistical learning task the present 
author carried out some preliminary studies on this problem which 
were eventually abandoned because the task proved too difficult. However, 
results as far as they went throw light on the problem. The task in this 
instance was a four-choice sequential probability learning task with prob- 
abilities of 0, 17, 33, and 50% of the appearance of any of the four lights 
following the appearance of any one. The probability pattern was com- 
plete on 64 trials and four blocks of 64 trials were used with each sub- 
ject. Six nonparanoid chronic schizophrenics were tested and as a 
measure of their activation level, skin potential level was measured 
(Venables, 1963b). It was found that the three patients with low skin 
potential levels learned the task reasonably well, whereas the three 


The total group was 
different pattern of initial, 
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‚ and hence possibly narrowed atten- 
nd resorted to position habits. It 
ents with high skin potential levels 
€ possibly more sick than the other 
nce resulted simply from the fact 
task. This did not seem to be the 
ression more of consisting of short 
1, not being reinforced, eventually 
and the three patients who learned 
gressively closer to the imposed pattern. 

Closely associated with this type of experiment are those on language 
when the main concern is with its Statistical structure. Lewinsohn and 
Elwood (1961) studied the learning of word lists having different orders 
of approximation to unconstrained language as in the Miller and Selfridge 


ps of subjects were tested, acute and chronic 
iz enic patients, a 


nd normals, It was found that 
the chronic schizophrenic patients stood out from the rest in their inability 
to make use of the conte: 


xtual constraints of language. When the vocabu- 
lary level was controlled it was found that the difference between chronic 


patients and the rest disappeared; however, it could be said that vocab- 
ulary performance insofar as it is the selection of a verbal description of 
a word while keeping that word in of attention might very 


well be contaminated by the s of narrowed attention. 
The confounding effect of vocabul 
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ferior to normal subjects with simil 


nt of a sentence 
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process schi 
pattern or structure of the words or letters normal] 

b. Effects of Anchor Stimuli. Weinstein, Golds -— 
(1958) carried out an experiment on the time estimat 


ics 
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in which they confirmed the finding of Lhamon and Goldstone (1956) 
that schizophrenics tend to overestimate the duration of a clock second; 
their main concern, however, was to determine the effect of anchor 
stimuli upon performance. They found that both normals and their 
sample of chronic schizophrenics responded to the "pulling" effect of 
long- and short-interval anchors. That is, if the series of time intervals 
used in the determination of “Second Estimate Point" (SEP) started 
with a short interval the SEP was "pulled" down, whereas if the series 
Started with a long interval the SEP was "pulled" up. However, if a 
subsequent series was given with the anchors reversed it was found that 
the judgment of normal control subjects was influenced by the earlier 
Series whereas the schizophrenics tended only to respond to the effect 
of the immediate series. It appears therefore that the range of attention 
of these chronic patients was narrowed and did not involve the use of 
cues presented earlier in time. 

Salzinger (1957) studied the elfect of a prior experience of lifting 
à heavy or a light weight on the judgment of the categories of a series 
of five weights. In this experiment the performance of acute rather than 
chronic schizophrenics was compared with that of controls. It was found 
that after the prior experience of the heavy weight the schizophrenics 
shifted more than the controls. Performance did not differ after the 
light anchor. Thus it would appear that consonant with the idea of 
increased awareness of objects in the environment or broadness of atten- 
tion in the acute schizophrenic, the patients were more aware during 
their subsequent judgments of the impression left by the earlier per- 
ceived heavy weight and then judgment was affected. 

An experiment carried out by Boardman, Goldstone, Reiner, and 
Fathauer (1962) on the effect of anchor stimuli on the judgment of a 
length of one inch provides further evidence to support the earlier 
Studies. Groups of acute and chronic schizophrenics and normals were 
each divided into three subgroups. The first made size judgments using 
à modified method of limits technique starting with a neutral 1.0-inch 
anchor; the second started with a 2.0-inch anchor and after a brief rest 
followed with a series with a 0.l-inch anchor. The procedure for group 
3 was the reverse of group 2. The comparisons of greatest interest are 
those between the judgments for series with 0.l-inch anchors presented 
first or second, and with 2.0-inch anchors presented first or second. It 
Was found that neither the normal nor the chronic schizophrenic group 
showed any significant differences between their initial or final 0.1-inch 
or 2.0-inch anchor judgments, whereas the effect of the previous anchor 
Situation was marked in the judgment of the acute schizophrenics. The 
effect of the immediate anchor condition was felt when it was 0.1-inch in 
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all groups but the immediate effect of the 2.0-inch 


the acute group. It is tempting to suggest that owing to “tunnel vision 
the extremities of the 2.0-inch anchor were not perceived by the chronic 
schizophrenics but this explanation seems too facile. We Р 5 
sented with further evidence that the acute schizophrenic still has in his 
broadened attention the impression left by the previous anchor given 
while this is excluded from the attention of the chronic schizophrenic. 
The lack of effect of anchors on normal subjects in this study confirms 
the results of 


an earlier study (Boardman, Aldrick, Reiner, and Gold- 
stone, 1959) and suggests that "the concept of 


of magnitude and is stable in a norm 


anchor was felt only in 


are thus pre- 


an inch is a social standard 
al population." 
3. Discussion and Conclusions 
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focus of attention. The use of the word “meaning” has evident relevance 
when one is considering, for instance, the statistical structure of language, 
but it is just as relevant when we take the view that the meaningfulness 
of the organism's surroundings is in relation to the degree to which 
they are perceived as part of a geographic and historic whole. 

Close parallels may be drawn between the position reached here and 
that, for instance, put forward by Osgood (1957). Following the postulates 
of Hull (1943) Osgood suggests that the effect of drive is to reduce the 
number of alternative responses in a hierarchy. This comes about be- 
cause of the multiplicative relationship according to which drive increases 
the probability of alternative reactions in proportion to their initial 
habit strengths. Thus, under high drive dominant responses become more 
dominant and less probable responses in the hierarchy become even less 
probable. If this is applied (as Osgood does) to language behavior, the 
number of associates arising from the presentation of a particular word 
decreases under high drive, and thus meaningfulness in the sense used 
earlier decreases. This neo-Hullian picture puts narrowing under high 
d of behavior but the parallels between it and 
rve further attention. In the case of temporal 
tween the two pictures is largely verbal and 


drive at the response en 
perceptual narrowing dese 
narrowing the division be 
revolves around the use of terms such as associations, associative re- 
sponses, and items in the attentive field which are the result of past 
experience rather than due to present sensory input. It should be noted 
that this learning theory viewpoint is in contrast to that which is used, 
for instance, by Mednick (1958) where high drive is said to act impar- 
tially upon correct and incorrect responses and therefore tend to push 
many irrelevant responses above threshold. 

This theoretical confusion is possibly resolved by a suggestion of 
Callaway and Stone (1960) to the effect that “arousal will lead to 
(1) reduced probabilistic coding (ie. increased over-all uncertainty 
with regard to the stimulus ensemble under consideration); (2) in- 
creased filtering (as in our previous concept of narrowed attention); 
and (3) reduced size of stimulus ensemble under consideration." As 
Callaway and Stone point out, this leads to some ambiguity as it is 
impossible to tell « priori whether a peripheral stimulus will be 
excluded because of narrowness of attention or included with nearly 
the probability of the most probable stimulus because of prediction 
(1). This model does comply with the Hullian prediction used by Med- 
nick in which competing responses are elevated nearer to threshold, 
but is not very useful as an explanatory concept until the conditions for 
narrowing of attention and the reduction to equality of the remaining 
probabilities are examined by experiment. 
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Ш. Relationship to Other Groups of Experimental Studies 


A. OVERINCLUSION 


So far, apart from a brief reference in Section LA 
been made to the extensive work on 
relevance to the present review. This has been done deliberately, partly 
because it would be unnecessary duplication to review the literature on 
this topic when this has already been done by Payne, Matussek, and 
George (1959) and Payne (1961), and second because to some extent the 
findings on overinclusion provide an independent check on the position 


which has been reached as a result of the present review of an entirely 
different set of experiments. 


Although they do not proceed to 
conventional tests of overinclusion, P 
gestion already quoted that ov 


, no mention has 
overinclusion which has obvious 


test it by other than the more 
ayne et al. (1959) make the sug- 


erinclusion is an attention defect and is 
due to the breakdown of a filter mechanism that “cuts out or inhibits 


the stimuli, both internal and external, that are irrelevant to the task 


in hand” or, in our terminology, that it results from excessive broadness 
of attention. 


Payne and Friedlander (1962) and Payne (1962) summ 
ings on overinclusion as follows: 
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unusual sortings. 

The same picture of overinclusive behay 
schizophrenics is found when the SCOres [тог 
(Payne and Friedlander, 1962). 

Feinberg and Garman (1961) studying thoy 
phrenics with a modified version of the pr 


Scheerer, 1941), 
ts. On this test 
PS of normals, 


10r Occurrin 


8 only in acute 
M three test. 


5 are combined 


ght disorder in schizo- 
Ogressive Matrices test 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 29 


categorized subjects’ responses in terms of their plausibility. Acute 
schizophrenics made many errors categorized as "implausible" in that 
their responses showed no geometrical similarity to the stimulus patterns. 
Chronic schizophrenics, on the other hand, made few errors of this kind 
and it was suggested that their performance should be interpreted in 
terms of “underinclusion” rather than "overinclusion." 

These studies tend to support the conclusions drawn from the 
examination of data from experiments not concerned with testing over- 
inclusion (Section П,В) that the perception of the acute schizophrenic 
is such that there is little selection of relevant and irrelevant material; it 
is in other words broad, while that of the chronic schizophrenic is 
narrow. That overinclusion occurs in schizophrenics with good prognosis 
ог those classed as reactive also fits the ideas put forward earlier in this 
section that overawareness of surroundings is to be found in "good 
premorbid" patients. 


B. STIMULUS GENERALIZATION 


Mednick's (1958) learning-theory approach to schizophrenia has as 
one of its main tenets the notion that acute schizophrenics show an 
abnormal degree of stimulus generalization. Defined as the condition 
"when a response, having been trained to a stimulus, is also elicited by 
similar stimuli,” stimulus overgeneralization is a close conceptual 
counterpart to overinclusion. If we consider that excessive width of 
attention is a disorder in which many items have equal prominence in 
onsciousness and may therefore have equal likelihood 


the organism's c 
the closeness of this concept to stimulus over- 


of eliciting a response, 


generalization is apparent. 
'The work on generalization in schizophrenics does not in fact enable 


Very strong conclusions to be drawn. Mednick (1955) used schizophrenics 
whose length of hospitalization was over 9 years on a task in which 
response to a central stimulus light was spatially generalizable to more 
peripheral lights. He found support for a hypothesis that schizophrenics 
would show greater generalization than normal. It was not, however, 
conclusive. On the basis of comparison with work on overinclusion and 
breadth of attention it should not be expected that the chronic schizo- 
phrenics used by Mednick would show generalization; however, the fact 
that half his sample were diagnosed as paranoid would possibly give rise 
to the small over-all amount of generalization that was shown. Garmezy 
(1951), however, using а sample of acute schizophrenics, showed that 
these patients tended to generalize from a training tone to other stimuli 


along the pitch dimension. 
Knopf and Fager (1959) showed some degree of spatial generaliza- 
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tion in schizophrenics, but as the chronicity of the patients was not 
designated, the value of their results is minimized. Other findings on 
generalization in schizophrenics tend to be by inference from experiments 
on verbal learning. Mednick and Devito (1958), for instance, showed that, 
in a complex learning situation with competing intralist associations 
where overgeneralization might be expected to worsen performance, 
acute schizophrenics were more impaired than chronic patients. Carson 
(1958), also working from the expectation of greater generalization in 
schizophrenics and hence greater difficulty with rote learning involving 
increasing intralist similarity, found his schizophrenic group produced 
results opposite to those predicted. In the light of the present thesis it is 
interesting to note that Carson's patients were of chronic status. 

On the basis of these rather limited results therefore it seems reason- 
able to conclude that acute schizophrenic patients may show an excess of 
stimulus generalization, while this is not the case with chronic patients. 
As these results seem to be parallel and similar to those on broadened 
attention, it does not seem wise to draw the conclusion that high stimulus 
generalization in acute patients arises because of high drive in these 
patients; rather, the reverse seems likely. Generalization on the basis of 
high drive as suggested by neo-Hullian theories may profitably be dis- 
tinguished from generalization arising from broadness of attention 
brought about by decreased arousal. The whole of this conceptual tangle 
deserves further experimentation. 


C. SENsory DEPRIVATION 


The impression gained from experiments reviewed earlier is that 
acute schizophrenic patients are under constant bombardment from an 
excess of unrestricted sensory stimulation, whereas the reverse position is 
the case with chronic patients whose narrowed attention restricts the 
amount of sensory input consciously perceived. These are changes 
brought about by the physiological state of the patient; the amount of 
sensory material available to consciousness can, however, be manipulated 
externally and by restriction might be expected to produce at least some 
of the symptoms of chronic Schizophrenia in normal subjects. As a 
corollary, restriction of sensory material might be expected to improve 
the state of the acute patient. 

In a review of anecdotal and experimental material on sensory 
deprivation in normal subjects, Rosenzweig (1960) Says, "One cannot 
help noting throughout these reports. . . . that there appear again and 
again Bleulers cardinal symptoms of schizophrenia; disturbance of 
associations, disharmony of affect, autism, ambivalence. We see disrup- 
tion of secondary thought processes, regression to the primary process, 
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impairment of reality testing and in addition such accessory symptoms as 
distortion of body image, depersonalization, delusions, hallucinations." 
That these pathological effects arise not merely from diminution of 
Sensory input per se but rather from the diminution in the meaningful- 
ness of sensory input is shown in a study by Davis, McCourt, and 
Solomon (1960), who restricted the usual patterning of sensation but 
Supplied input in the form of a randomly flashing light. Discussing the 
pathological disturbances arising from this procedure they state, Tt 
appears that what the brain needs for normal functioning is not quantity 
9r change of sensation per se, but a meaningful contact with the outside 
World." 
, Unfortunately experiments to test the other side of the coin are 
inconclusive. Gibby, Adams, and Carrera (1960) reported an enduring 
change for the better in a mixed group of patients containing some 
schizophrenics of unspecified chronicity after exposure to a sensory 
deprivation situation. Improvement was also reported by Harris (1959), 


particularly [or hebephrenics, but the change did not last beyond the 
*Xperimental situation. No change was reported by Smith, Thakurdas, 
and Lawes (1961), who used chronic patients, ог by Cleveland, Reitman, 
and Bentinck (1963). The lack of change in chronic patients might of 
Course be expected as they might be considered to be already in a state of 


Sensory deprivation. More conclusive evidence from this source must 
await experiments with better attention to diagnostic and chronicity 
Criteria, 
тх REACTION TIME 

One of the largest sections of integrated experimental work on 
Schizophrenia is that on the effect of time uncertainty in RT. This has 
been discussed by Shakow (1962, 1963). A related set of experiments are 
ainty which have recently been reviewed by 
Sutton and Zubin (1963). While this work deals with "expectancy" or 
"set" and is thus not strictly on the topic of the present review, some of 
the statements which have been made in the course of theoretical dis- 
cussions make it evident that it stone be considered at the present 
time, As an example, We find in Sutton and Zubin (1963) that "studies 
Strongly suggest that the state of madness of the patient is disproportion- 
ately affected by events which are recent in time ; however, on the other 
fani, there ЯШ say "it is also dear in the foreperiod experiments, 
and suggested in recent. work = modality experiments that long-range 
influences operating over the whole series of trials act to ‘over influence’ 
апа ‘over prepare’ the schizophrenic subject.” There thus seems super- 
ficially to be a conflict between these studies, which have mainly been 
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carried out on chronic patients, and those already reviewed, which show 
temporal narrowing in these patients. In attempting to resolve the соп» 
flict it is important to keep in mind the different processes involved in 
perception and in the development of "sets" or "expectancies." In 
perception what is always in the forefront is the present stimulus and the 
narrowness or broadness of attention must be considered in relation to 
this present anchor point. In the case of expectancy we have a state of 
the organism built on the basis of past stimuli and experiences and 
having relevance to response to a future stimulus. The range of relevant 
items in the attentive field is not anchored to a present stimulus as in 
the perceptual process. Thus in expectancy we may : 
the range of attention being narrowed to a particular subset of items in 
the past so that the subject acts with response to this “minor set" in a 
manner which is inadequate. For efficient performance attention to a 
wider range of items is required to develop a "major set" which may 
bring the range of expectancy up to date. By such an explanation as this 
the theoretical positions arising from "perceptual" and "expectancy 
experiments may be mutually reconciled, 
The leading finding of the RT ex 
the effect of regular and ir 
When, in the regul 
block of trials, nor 


have the position of 


periments on this subject is that on 
regular foreperiods on speed of reaction. 
ar procedure, the foreperiod is kept constant for a 
mal subjects appear to be able to take advantage of 
this procedure for foreperiods up to 20 sec. Chronic schizophrenics, on 


the other hand, are only able to benefit from the regularity for fore- 
periods up to 5 or 6 sec., after which they 


foreperiods are presented randomly, giy 
develop a set to respond. This differen 
were able to utilize the regularity of 
arbitrarily constructed set index (Rodnick and Shakow, 
found (Rosenthal, Lawlor, Z: 
to a rating of the patient's mental health, 
Of particular interest to the present review is 
and Singer (1945), who examined the effect of sodium 
of patients whose duration of illness w. 
be considered to be in the early chronic Stage. Without drugs these 
patients showed the same pattern of relationships as those shown by 
Rodnick and Shakow (1940). The normal subjects had faster reactions 
with regular foreperiods than with irregular foreperiods up to 20 sec, in 
length. The schizophrenics, on the other hand, showed that at about 
6-sec. foreperiod length the advantage of the regular foreperiod was lost. 
Under sodium amytal, which it will be remembered (Section ILA) is 
supposed to induce a broadening of attention and therefore in the 


1940), which was 
0) to be closely related 


a study by Huston 
amytal on a group 
аз 9 months and might therefore 
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Huston and Singer experiment should enable the subject to benefit 
from constant foreperiod length, the pattern of schizophrenic results was 
similar to that of the normals. In a later experiment Huston and Senf 
(1952) studied the effect of amytal upon the performance on a similar 
RT task with both early and chronic schizophrenics. Because only three 
preparatory foreperiods ‘were used results were not strictly comparable to 
those above. The trend, however, was with the regular procedure under 
drugs for the chronic schizophrenics to show distinct improvement in 
comparison with the undrugged condition. The improvement was not 
marked in the case of the early schizophrenics. 

Another study from Shakow’s group (Zahn, Rosenthal, and Shakow, 
1963) very nicely demonstrates the point under discussion. and has 
Particular relevance to the experiments on anchor stimuli (Section 
IL,B,2,b). Using only irregular foreperiods, these experimenters were able 
to show that when the foreperiod before the one presently involved was 
longer than the present one the chronic schizophrenic patients were more 
affected by it than normals. The conclusion is thus reached that, “instead 
of basing their pattern of preparation on their experience with the 
Series of preparation intervals as a whole, the patients seem to base it, 


Much more than do the normal sub 


the series.” 
In the case of stimulus uncertainty experiments (Sutton and Zubin, 


1963) the uncertainty is not when the stimulus is going to appear but 
What stimulus is to appear. In these studies disturbance of performance 
is shown more by changes of stimulus from light to sound than by the 
degree to which expectancy for one or another type of stimulus is built 
up. It is with this sort of influence of the parameters of the stimuli 
Within the total context that the next section will be concerned. 


jects, on the most recent event in 


IV. Stimulus Parameters as Determinants of Input Dysfunction 


f factors which are responsible for dysfunctions of 
nic patient are those concerned with the simple 
Other more complex parameters such as their 
aversive or social content when stimuli act as reinforcing agents have 
been reviewed recently by Silverman (1963) and will not be considered 
here. Data are available in amy quantity only on the effects of the 
parameters of modality and intensity and the discussion will be limited 
to these aspects. It might be expected, however, that the proposals put 
forward by Kaplan (1960) on the differential sensitization to long and 
short wavelengths by autonomic arousal would provide a further dimen- 
sion on which normal-schizophrenic differences might be expected. 


A further set о 
input in the schizophre 
parameters of stimuli. 
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А. Морлитү 


In Section ILA some proposals by Bharucha-Reid (1962) concerning 
the way in which subcortical « 


lysfunction may lead to disorganization at 
the cortex were outlined. One 


of these was that the subcortex may affect 
the function of sensory receiving areas selectively and thus possibly affect 
a single sense modality and thus disturb the normal relationship which 
exists between the different. modalities. 

There is some evidence for such 
Venables and O'Connor (1959) carried out an experiment on the 
reaction time to auditory and visual stimulation, using groups of chronic 
schizophrenic subjects divi is of rating scales measuring 
withdrawal and paranoid tendency. A control group of normal subjects 


was also tested. The finding usually reported (cf. Teichner, 1954) that 
auditory RT’s are faster than visual R 


replicated in the case of 1 ects in this experiment. It was 
also shown by those schizophrenics designated as both paranoid and not 
withdrawn. (This is a group similar to that group labeled coherent 
paranoids which behaved differently from the remainder of the patients 
in a study detailed earlier—Section ILB,l,c) The other schizophrenics 
had faster reactions to visual stimuli than to auditory stimuli. This 
result is in conformity with a much earlier study (Wells and Kelley, 
1922) in which a group of dementia praecox patients showed a tendency 
for RT to sound to be relatively longer than to light as compared with 
normals. The effect of different stimulus modalities was also shown in a 
study of the effect of background stimulation 
response of chronic schizophrenics (Venables, 
auditory background had ; 
slowing down responses in 
in the remainder of schizo 
to be speeded up. The ; 
presented in close temporal proximity 
of its excitability (Venables, 1 


a disturbance in schizophrenia. 


T’s with normal subjects was 
пе normal subj 


ackground in 
anoid subgroup, while 
Was for the response 
Solve pairs of stimuli 
!'Bgested as a measure 
d of fusion of paired 
9n of paired flashes in 
tween the two fusion 


ative dominance of опе 
modality over the other. It was found that in normal subjects all two- 


click thresholds were lower than two-flash thresholds, Among the chronic 
schizophrenics who were tested there was in genera] д smaller difference 
between the two types of thresholds and among the very deteriorated 
patients there was a reversal of the normal findings: two-click thresholds 
were longer than two-flash thresholds, There was found to be a 
significant correlation between a rating of deterioration a 


€ threshold of fusi 
The difference be 
asure of the rel 


an unpublished experiment, 
thresholds was used as a me 


nd the difference 
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between flash and click thresholds giving an impression as with the 
earlier studies of a greater disturbance in the auditory than the visual 
modality. It is an interesting parallel that simple RT to sound is 
significantly impaired under medication with LSD-25, while simple RT 
to light is not (Abramson, Jarvik, and Hirsch, 1955), and that under this 
drug normal subjects often display slowness, proverty of speech, and 
withdrawal (Rinkel, de Shon, Hyde, and Solomon, 1952), the symptoms 
that are displayed by the withdrawn deteriorated schizophrenic. 

Zahn, Rosenthal, and Lawlor (1962) studied the GSR orienting 
reactions to visual and auditory stimulation in chronic schizophrenic 
and normal subjects. The basal log conductance during the series of 
stimuli was greater with visual than with auditory stimuli in schizo- 
Phrenics while the reverse was the case with normals. This pattern was 
shown initially with the numbers of specific GSR's elicited but owing to 
the faster adaptation displayed by the normal subjects the pattern was 
Not maintained after the first eight trials. 

The same sort of alteration of the usual pattern of modality 
dominance is also shown in a study by Sutton, Hakerem, Zubin, and 
Portnoy (1961), where in addition to the effect of shifts in sensory 
Modality which was the primary study of the experiment there was a 
8eneral tendency for КТУ to be faster to light than to sound with the 
chronic schizophrenic patients who were used as abnormal subjects. 

This study and others on the same topic reviewed by Sutton and 
Zubin (1963) all show that with schizophrenics reactions to sound stimuli 
are impaired if the stimulus in the previous trial was a light, whereas 
reactions to light stimuli, however preceded, do not yield differences 
between patients and normals. This could be considered as further evi- 
dence for relative impairment of the auditory modality in schizophrenics. 

Relative disorganization of the auditory modality in schizophrenics 
is a possible explanation for the results BE the experiment of Fedio, 
Mirsky, Smith, and Parry (1961). These workers investigated the effect 
of EEG activation ироп RT following the experiment of Lansing, 
Schwartz, and Lindsley (1959). Fedio et al. used the sound of a bell to 
evoke alpha blockade and presented a buzzer as a signal for reaction 
When the EEG alpha was blocked. They compared this condition with 
Others vere the EEG alpha was not blocked or was blocked spon- 
taneously. It was fount hrenics di : 
that the speed of RT of рор уаш did not increase when alpha was 
blocked. While not the interpretation put forward by the authors, 
disorganization of the auditory modality and of the means by which the 

aroused is a possible explanation of the results. 
tudies on the effectiveness of stimuli in the different 


nd in contrast to the results from normal subjects 


auditory cortex is 
To date the 5 
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modalities and their relationship to schizophrenic pathology has not 
been given much attention. Work on this subject is not easy as other 
parameters such as intensity may produce swamping effects. However, 
attempts made, for instance, by Stevens (1955) to produce equivalent 


intensity scales for the visual and auditory modalities enable experimenta- 
tion to be carried out. 


B. INTENSITY 
A pair of studies which 
preceding section are those w 


auditory stimuli on the speed 
Tizard, 1956, 1958). In the fi 


belong partly in this and partly in the 
hich consider the intensity of visual and 
of reaction in schizophrenics (Venables and 
rst study eight intensities of stimulus light 
were used using a two-log unit range from 16 to 1500 foot-candles, It was 
found that in contrast to normal subjects, who showed a decrease in RT 
to an asymptotic value, the RT of chronic nonparanoid schizophrenics 
tended to increase after a fall to an optimal value at about 200 [oot- 
candles, and became slower with increasing intensity, In contrast this 
“paradoxical” increase in RT was not shown with a wide range of 
auditory stimulus intensities up to 115 db with 1000-cps and 200-cps 
tones and white noise. With relation to the preceding section it is 
interesting to note that in comparing the effects of visual and auditory 
stimulus intensities it was evident that whatever the change of intensities 
the auditory RT tended to be slower than the visual for the nonparanoid 
chronic schizophrenics used in these studies, 

More direct evidence on the effects of intensity 
the work of Shagass and Schwartz (1963a,b). Evok 
were examined using averaging techniques to 
ratio. Only the results of work on somato-sensor 
published so far, but in this it can be seen tl 
schizophrenics there is a greater increase in ey, 
with increase in stimulus intensity than the 
characteristic, schizophrenics are similar to 
exception of dysthymic neurotics. This excepti 
interest as it has been postulated in some theo: 
it is because the reactions of early schizophren 
anxiety states that the train of pathologic 
florid schizophrenia begins. 
. S dena пе motea that the К evoked response that 
Shagass and Schwartz are dealing 


of stimuli is given by 
ed cortical potentials 
improve signal:noise 
у stimuli have been 
hat in relatively acute 
oked potential amplitude 
re is in normals, In this 
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is thus somewhat indirect. It will, however, be of direct interest to see 
whether a nonlinear relationship between visual stimulus intensity and 
the early component of evoked response appears in schizophrenics; if it 
does not, the explanation for the paradoxical RT findings must lie in 
later cortical functioning. 


V. The State of the Patient 


At the end of Section LB two opposing viewpoints were mentioned 
concerning the relative levels of drive or arousal of acute and chronic 
schizophrenic states. Increasingly throughout the review of perceptual 
data the impression has gained ground that, if the findings on normal 
subjects (Section ILA) concerned with the motivational or arousal 
correlates of broadened or narrowed attention are applicable to the more 
al disturbances involved in schizophrenic pathol- 
ogy, we shoud expect acute patients to be characterized by low, and 
chronic patients by high, levels of arousal or drive. There are, further- 
more, suggestions that differences in physiological levels of activity may 
be found within the acute and chronic categories, depending upon the 
patient's subdiagnosis or state of withdrawal, or whether his past history 
indicates that he belongs to process or reactive, poor or good premorbid, 


categories. 

The rather glib way in which the terms arousal, drive, and sym- 
pathetic or parasympathetic activity have been used so far in discussion 
тау possibly be excused insofar as they are meant to be taken as 
shorthand summaries for concepts having wider implications. It is not 
the purpose of this chapter to present a detailed examination of these 


terms: nevertheless insofar as they have been invoked in explanation a 
Word must be said about their usage. The interrelation between the 
cortex and such subcortical mechanisms as the hypothalamus and 
reticular formation is complex, but without doing too great violence to 
the finer points of detail it may be said that stimulation of both the 
Posterior hypothalamus (which is responsible for sympathetic functions) 
and the dorsal region of the mesencephalic reticular formation causes 
cortical arousal signaled by such phenomena as desynchronization of the 
alpha rhythm. The functions of both these structures are blocked by 
barbituates and it is this principle which is involved in the sedation 
threshold test, the results of which are reviewed below. While in general 
it may be said that the anterior and posterior hypothalamus, as central 
structures mediating respectively parasympathetic and sympathetic func- 
tion, operate in a reciprocal fashion so that activity in one tends to 
promote inhibition of the other, this is not universally the case. Acute 


long-term physiologic 
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emotional disturbance, for instance, is accompanied by signs of both 
sympathetic and parasympathetic activity. Bearing this in mind in 
general we may say that conditions which produce activation of the 
central sympathetic mechanisms tend also to produce cortical arousal. 
On the other hand, drugs such as tranquilizers which shift the balance 
between the anterior and posterior hypothalamus to the parasympathetic 
side decrease the intensity of discharges from the hypothalamus and 
reticular formation to the cortex and lower the arousal level of the 
cortex. 

The reaction of schizophrenic patients to drugs may allow us to 
draw conclusions about the patient's physiological state. Thus, if the 
action of a depressant drug is to improve the patient's condition, it is 


possible to make the inference that he was formerly 


in a state of 
hyperarousal. 


One series of studies concerns the reaction of patients to such general 


depressants as sodium amytal. Lindemann (1932) was one of the first to 


show a temporary improvement in schizophrenic behavior after medica- 
tion with sodium amytal. This was shown by increased contact with the 
environment and warmth of emotion. This work was repeated more 
recently, for instance, by Fulcher, Gallagher, and Pfeiffer (1957), who 
found that the number of “lucid interv 
increased under amobarbital. Stevens 
remission of catatonic stupor under amobarbital and by taking EMG, 
EEG, and EKG recordings were able to show that remission from stupor 
was accompanied by a decrease in the level of cortical and autonomic 
activity. These studies all suggest a high degree of arousal in chronic 
deteriorated or catatonic patients. This idea is supported by the work on 
sedation threshold summarized by Shagass (1960). Amobarbit 
at а constant rate until the patient's speech is slurred or a point of 
inflection of the curve relating EEG fast activity to dose is reached. The 
amount of amobarbital required to reach this point is higher in chronic 
schizophrenics, other than those diagnosed as simple, than in normals. 
Patients diagnosed as acute schizophrenic, on the other hand, have the 
lowest of any sedation threshold reported. This low threshold in acute 
schizophrenics is confirmed in a study by Herrington and Claridge (1963). 
Anxiety states are also shown to have a high sedation threshold similar 
to that of chronic schizophrenics. The outcome of these studies is that 
there is evidence for high cerebral activity in chronic schizophrenia while 
the reverse is true of acute schizophrenic patients. This is based on the 
assumption that a patient in a state of high central 
higher dose of barbiturate to eliminate that activity 


als" among chronic schizophrenics 
and Derbyshire (1958) reported 


al is given 


activation requires a 
than one whose level 
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of activity is low. Ап alternative explanation may, however, be put 
forward in terms of a differential sensitivity to barbiturates. The study by 
Fulcher et al. (1957) mentioned above also used the drug arecoline and 
found that like amobarbital it produced "lucid intervals" in chronic 
schizophrenics. Arecoline is of interest because it has been shown to have 
pure parasympathetic (muscarinic) activity on the brain in comparison 
to amobarbital, whose action is more complex. Arecoline is one of the 
drugs like reserpine, pilocarpine, and physostigmine that inhibit condi- 
tioned avoidance in rats. It could be tentatively suggested that with- 
drawal in schizophrenics is a form of conditioned avoidance. In support 
of this it has been shown (Venables and Wing, 1962) that withdrawal in a 
nonparanoid chronic schizophrenic population is positively correlated 
with the level of skin potential shown by the subject. It has also been 
shown (Spain, 1963) that in a similar population the degree of eyelid con- 
ditioning (a paradigm for conditioned avoidance in humans?) is also 
positively related to the level of the subject's skin potential. Parasympa- 
theticomimetic drugs may therefore act to produce remissions in deterio- 
rated and withdrawn schizophrenics by decreasing conditioned avoidance 
and withdrawal. That avoidance conditioning is successful in schizophre- 
nics is shown by the number of studies which give evidence of the greater 
effectiveness of aversive reinforcers in comparison with rewards in these 
patients (cf. Silverman, 1963). m E . В 

Further support for the picture of higher activity in chronic 
schizophrenics is gained from the study of Williams (1953), who showed 
that during rest these patients showed higher basal levels of skin con- 
ductance, pulse rate, and respiration шер normal. Similar results were 
also reported by Malmo, Shagass, and Smith (1951). 

Within the chronic schizophrenic population, as has already been 
il has been shown to be related to the level of autonomic 
activity given by skin potential as an index and also to the level of 
cortical activity exemplified by two-flash fusion threshold (Venables and 
Wing, 1962). More recent unpublished data support this conclusion by a 
finding of a relationship between two-click threshold and withdrawal. 
These results, which show that the greater the degree of autonomic, visual 
cortical, or auditory cortical activation. the greater is the withdrawal, 
apply only to nonparanoid schizophrenics and paranoid schizophrenics 
who are markedly incoherent. Standing apart from these are the coherent 
paranoid schizophrenics in whom there is a tendency (unpublished data) 
for withdrawal to be related to underarousal. In. nonparanoid chronic 
o-llash threshold and skin potential as measures of 
ctivation are positively related. This pattern is 


said, withdraw: 


schizophrenics tw 
cortical and autonomic a 
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not found in normal subjects and coherent paranoid patients (Venables, 
1963b). The atypical reaction of paranoid schizophrenics has been noted 
earlier in the review, and is confirmed by these latest results. Support 
for this view is given by such disparate studies as those of Stevenson, 
Derrick, Hobbs, and Metcalfe (1957), which suggests that paranoid 
schizophrenics differ from other schizophrenics in adrenocortical response 
and phosphate excretion, and by Wertheimer and Wertheimer (1955) 
who showed that the structure of capillary loops in all but paranoid 
schizophrenics was different from normals. This viewpoint is in agree- 
ment with that put forward by Shakow (1962), who says, “thus of the 58 
measurements which we have made of a wide range of psychological 
functions on groups of normal, paranoid, and hebephrenic subjects, we 
found the paranoid to be nearer the normal in 3] instances and the 
hebephrenic nearer in only 7 instances. In 20 instances, however, the 
paranoid and hebephrenic fell on either side of the normal.” 

Among studies which are concerned with testing the difference 
between process and reactive schizophrenics one of the most convincing 
is that by Gromoll (1961), who, starting with the hypothesis that the 
process patients would be underaroused and the reactive patients over- 
aroused, using percentage time alpha as his index, in fact found the 
complete reverse, 

In this brief review of the physiological status of different. classes 
of schizophrenic patients there has been the attempt only to consider 
the level of physiological activity. It would appear from perusal of the 
literature that many false conclusions have been drawn because of 
failure to distinguish level of ongoing activity from reactivity to stimuli. 
Attempts to infer one from the other tend to be inconclusive: Although, 
for instance, Wilder (1958) argues convincingly for the 
value" by which there is an inverse linear 
of response and the level from which 
instance, Silverman, Cohen, and Shm 
curvilinear relationship betw 
the law of initial value, or i 
relationship between level and response, that w 
to be a discrepancy between what is reported 
and the state of the patient proposed in this s 


'Thus it is said that acute schizophrenic patients show large emotional 
reactions, anxiety, fear, anger, while the affect of the chronic patient 
tends to be "flat." It is against the Proposed low leve] of activity of the 
acute schizophrenic that a large emotional response may be seen, while 
because of the high existing level of activity of the chroni 
a small response may be evoked. 
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Vi. Summary 


esis which has been put forward in this review is that, 


The main th 
schizophrenics cannot 


for the purpose of considering input dysfunction, 
be taken as a whole. A distinction must be drawn between acute and 
and, within this subdivision, attention must be paid to 
hether or not he may be designated as 
process or reactive. The interrelationships between the subdivisions 
have not become clear because little attempt has been made to sort them 
out. It is possible, for instance, that with the slow-onset, processtype 
ld not expect there to be an acute stage of the 
hand, as has been hinted at in the theory 
e should not perhaps ever think of a 
chronic stage of paranoid schizophrenia. If, as has become apparent from 
the experimental work reviewed, there is a marked distinction between 
the physiological states and hence the perceptual abilities of acute and 
more experimental time can be profitably expanded 
on the analysis o£ the stage of transition from one to the other stage and 
the factors—physical, physiological, and social—which accelerate or 
retard the process. If we bear these points in mind, the experimental 
work which has been reviewed points to a general statement of the 
following kind. Chronic schizophrenic patients—and possibly included 
in this category are process patients—tend to be characterized by a state 
of restriction of the attentional field resulting from elevated states of 
sympathetic and cortical activation. Attention is restricted not only to 
the extent that peripheral sensory items contemporaneously present do 
not rise into consciousness, put also involved is the nonrecognition of 
items in memory which form part of the meaningful structure in which 
the present central item appears. Drugs and procedures which reduce 
arousal tend to improve the clinical status of these patients alongside 
(or even because of) an improvement 1n their perception of the external 
world and an increase in its meaningfulness resulting from increased 


breadth of attention. 

to the chronic patient, the acute (and possibly the 
reactive and paranoid) patient is characterized by an inability to restrict 
the range of his attention 50 that he is flooded by sensory impressions 
from all quarters. Items of all kinds have equal importance, and the 
meaningfulness of the external world tends to be lost for the opposite 
reason to that which applies with the chronic patient. The figure-ground 
relationship which allows a picture to convey information is destroyed 


equally by making the picture nearly all figure or making it nearly all 
ground. The acute patient's broadened level of attention would appear 


chronic patients 
the patient's subdiagnosis and w| 


schizophrenic we shou 
illness at all. On the other 
proposed by Fish (Section LB), w 


chronic patients, far 


In contrast 
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to arise from the low level of cortical activation or possibly the para- 
sympathetic imbalance which he displays. A wide variety of processes 
and skills seem to be affected by the attentional dysfunctions outlined, 
but nevertheless the impression should not be gained that what has been 


described is more than a segment of the total abnormality displayed by 
the schizophrenic patient. 


References 


Abramson, H. A., Jarvik, M. E., & Hirs th, M. М, LS.D. 25 X Effect on RT to auditory 
and visual stimuli, J. Psychol., 1955, 40, 39.59 

Agnew, N., & Agnew, M. Drive level effects on tasks of narrow and broad attention. 
Quart. J. exp. Psychol., 1963, 15, 58-62. 

Allison, Т. Recovery functions of somatosensory evoked responses in man. EEG clin. 
Neurophysiol., 1962, 14, 331-343. 

Ashman, G. R. Binary choice learning strategies in schizophrenics. Paper read at 
Am. Psychol. Assoc. meeting, Cincinnati, Ohio, 1959. 

Bahrick, H. P., Fitts, P. M., & Rankin, R. E. Effect of incentives upon reactions 
to peripheral stimuli. J. exp. Psychol., 1952, 44, 400-406. 

Bharucha-Reid, R. P. The internal modulating system and stress: a neurophysiological 
model. J. Gen. Psychol., 1962, 66, 147-158. 

Boardman, W. K., Aldrick, К. C., Reiner, M. L., & Goldstone, S. The effect of anchors 
upon apparent length. J. gen. Psychol., 1959, 61, 45-49. 

Boardman, W. K., Goldstone, S., Reiner, M. L., & Fathauer, W. F. Anchor effects 
spatial judgment and schizophrenia. J. abnorm. soc. Psychol., 1962, 65, 273-276. 

Boring, E. G. Size constancy and Emmerts Law. Amer. J. Psychol., 1940, 53, 293-995, 

Bradley, R. B., & Elkes, J. Effect of some drugs on the electrical activity of the brain. 
Brain, 1957, 80, 77-117. 

Brown, G. W. Length of hospital stay and schizophrenia: a review of statistical studies. 
Acta psychiat. et neurol. Scand., 1960, 35, 414-430. 

Callaway, E. The influence of amobarbital (amylobarbitone) and methamphetamine on 
the focus of attention. J. ment. Sci., 1959, 105, 382-399. 

Callaway, E, & Band, R. I. Some psychopharmacological effects of atropine. Arch. 
Neurol, Psychiat., 1958, 79, 91-109. 

Callaway, E, & Dembo, D. Narrowed attention: a psychological phenomenon that 
accompanies a certain. physiological change. Arch. Neurol. Psychiat., 1958, 79, 
74-90. 

Callaway, E., & Stone, G. Re-evaluating the focus of attention. In L. Uhr & J. С. 
Miller (Eds.), Drugs and behavior. New York: Wiley, 1960. 

Callaway, E, & Thompson, S. V. Sympathetic activity and perception. Psychosom. 
Med., 1953, 15, 443-455. 

Carson, R, C. Intralist similarity and verbal rote learning performance of schizophrenics 
and cortically damaged patients. J. abn. soc. Psychol., 1958, 57, 99-106. 

Cleveland, S. E., Reitman, E. E., & Bentinck, C. Therapeutic effectiveness of sensory 
deprivation. Arch. Gen. Psychiat., 1963, 8, 455-460. 

Conrad, K. Die beginnende Schizophrenie. Stuttgart, Germany: Thieme, 1958. 

Davis, D. R. The disorganization of behavior in fatigue. J. Neurol. Psychiat., 1946, 
9, 23-29. (a) | 

Davis, D. К. Neurotic predisposition and disorganization observed in experiments with 
the Cambridge cockpit. J. Neurol. Psychiat., 1946, 9, 119-124. (b) 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 43 


Davis, J. M., McCourt, W. F., & Solomon, P. The effect of visual stimulation on 
hallucinations and other mental experiences during sensory deprivation. Amer. 
J. Psychiat., 1959-1960, 116, 889-892. 

Drew, G. C. Mental fatigue. Flying Personnel Research Committee, 1940, Rep. No. 

Drew, G. C. The study of accidents. (Presidential address to the British Psychological 
Society), Bull Brit. Psychol. Soc., 1963, 16 (52), 1-10. 

Dunn, W. L. Visual discrimination of schizophrenic subjects as a function of stimulus 
meaning. J. Pers., 1954, 23, 48-64. 

Easterbrook, J. A. The effect of emotion on cue utilization and the organization of 
behavior. Psychol. Rev., 1959, 66, 183-200. 

Edwards, W. Apparent size of after images under conditions of reduction. Amer. J. 
Psychol., 1953, 66, 449-4 

Federn, P. Ego psychology and the psychoses. London: Imago, 1953. 

Fedio, P., Mirsky, A. F., Smith, W. J., & Parry, D. Reaction time and EEG activation 
in normal and schizophrenic subjects. EEG clin. Neurophysiol., 1961, 13, 923-926. 

Feinberg, L, & Garman, E. M. Studies of thought disorder in schizophrenia. Arch. 
Gen. Psychiat., 1961, 4, 191-201. 

Fish, F. A neurophysiological theory of schizophrenia. J. ment. Sci., 1961, 107, 828-838. 

Fulcher, J. H., Gallagher, W. J- & Pfeiffer, C. C. Comparative lucid intervals after 
amobarbital CO,, and arecoline in chronic schizophrenics. Arch. Neurol. Psychiat., 


1957, 78, 392-395. 

Funkenstein, D. H., Greenblatt, M., & Solomon, Н. С. Psychophysiological study of 
mentally ill patients. Part I: the status of the peripheral autonomic nervous system 
as determined by the reaction to epinephrine and mecholyl. Amer. J. Psychiat., 

106, 18-28. 

N. Stimulus differentiation by schizophrenic and normal subjects. under 

and punishment. J. Pers., 1951, 20, 253-276. 


conditions of reward : 
Gellhorn, E, & Miller, А. D. Methacholine and Noradrenaline tests. Arch. Gen. 


Psychiat., 1961, 4, 371-380. | | | 
Gibby, R. G., Adams, Н. B., & Carrera, R. N. Therapeutic changes in psychiatric 


patients following partial sensory deprivation. Arch. Gen. Psychiat., 1960, 3, 33-42. 
Goldstein, K., & Scheerer, M. Abstract and concrete behavior: an experimental study 


x lo. 9 

with special tests. Psychol. Monogr., 1941, 50, No. 2. | | | 

Greenberg, A. Directed and undirected learning in chronic schizophrenia. Unpublished 
Ph.D. Thesis, Columbia Unive New York, 19 3 u | | 

Gromoll, Н. F. The pro "active dimension of schizophrenia in relation to cortical 

activation апа arousal. Unpublished Ph.D. Thesis, Univer. of Illinois, Urbana, 


1961. 
Hake, Н. H., & Hyman. 
J. exp. Psychol., 1953 
Hamilton, V. Size constancy 
54, 25-39. 
Harris, А. Sensory deprivation н 
Hartman, A. M. The apparent i | 
i renics er. J. Psychot. 
С ee н of behavior. London: < ш Hall, 1949. 
Herrington, R. №., & Claridge, С. 5. Personal communication. 1963. | 
Hoffer, A., & Osmond, H. F. Schizophrenia—an autonomic disease. J. nerv. ment. Dis. 
1955, 122, 448-452. 


Holway, A. H., & Boring. E. G- 
J. Psychol., 1941, 54. 


227. 


H. R. Perception of the statistical structure of binary symbols. 


45, 64-74. 
and сис responsiv 


eness in psychosis. Brit. J. Psychol., 1963. 


and schizophrenia. J. ment. Sci., 1959, 105, 235-236. 
c of after images in delusional and nondelusional 


1962, 75, 587-5 


Determinants of apparent visual size with distance 


; 21-37. 
variant. Amer. 


44 Р. Н. VENABLES 


Hull, C. L. Principles of behavior. New York: Appleton-Century-Crofts, 1943. 

Hunt, J. McV., & Cofer, C. N. Psychological deficit. In J. McV. Hunt (Ed.), Personality 
and the behavior disorders. New York: Ronald Press, 1944, pp. 971-1032. 

Huston, P. E., & Senf, R. Psychopathology of schizophrenia and depression. I: Effect of 
amytal and amphetamine sulphate on level and maintenance of attention. Amer. 
J. Psychiat., 1952, 109, 131-138. 

Huston, P. E., & Singer, M. M. Effect of sodium amytal and amphetamine sulphate on 
mental set in schizophrenia. Arch. Neurol. Psychiat., 1945, 53, 365-369. 

Kaplan, S. D. Autonomic visual regulation. Psychiat. Res. Rep., 1960, 12, 104-114. 

Kinsey, A. C., Pomeroy, W. B., Martin, C. E., and Gebbard, P. H. Sexual behavior 
of the human female. Philadelphia, Pennsylvania: Saunders, 1953. 

Klein, С. S. Need and regulation. In M. R. Jones (Ed.), Nebraska Symposium on 
Motivation. Lincoln: Univer. of Nebraska Press, 1954, pp. 22 

Knopf, I. J., & Fager, R. E. Differences in gradients of stimulus gene 


function of psychiatric disorder. J. abn. soc. Psychol., 1959, 59, 73-76. 
Kohn, H. Effects of variation of intensity of 


lization as a 


experimentally induced stress situations 
upon certain aspects of perception and performance. J. genet. Psychol., 1954, 85, 
289-304. 

Lansing, R. W., Schwartz, E., & Lindsley, D. B. Reaction time and EEG activation, 
J. exp. Psychol., 1959, 58, 1-10. 

Leibowitz, H. W., & Pishkin, V. Perceptual size constancy in schizophrenia. J. consult. 
Psychol., 1961, 25, 196-199. 

Lewinsohn, P. M., & Elwood, D. L. The role of contextual constraint in the learning of 
language samples in schizophrenia. J. nerv. ment. Dis., 1961, 133, 79-81. 

Lhamon, W. T., & Goldstone, S. The time sense. Estimation of one second duration by 
schizophrenic patients. Arch. Neurol. Psychiat., 1956, 76, 625-629 

Lindemann, E. Psychological changes in normal and abnormal dividuals under the 
influence of sodium amytal. Amer. J. Psychiat., 1939, 11, 1083-1091. 

Lindsley, D. B. The reticular formation and perceptual discrimination. In H, Н. 
Jasper, L. D. Proctor, R. S. Knighton, W. C. Noshay, & R. T. Costello (Eds.), 
Reticular formation of the brain. London: Churchill, 1958. 

Lovinger, E. Perceptual contact with reality in schizophrenia. J. abn. soc. Psychol., 
1956, 52, 87-91. 

McGhie, A., & Chapman, J. S. Disorders of attention 
phrenia. Brit. J. med. Psychol., 1961, 34, 103-116. 

Malmo, R. B., Shagass, C., & Smith, A. A. Responsiveness in chroni 
J. Pers., 1951, 19, 359-375. 

Mednick, S. A. Distortions in the gradient of stimulus generalization related to 
cortical brain damage and schizophrenia. J. abn. soc. Psychol., 1955, 51, 536-549. 

Mednick, S. A. A learning theory approach to research in schizophrenia, Psychol. 
Bull., 1958, 55, 316-327. 

Mednick, S. A, & DeVito, R. Associative com 
schizophrenia. Paper read at Eastern Psycholo, 
1958. 

Miller, G. A., & Selfridge, J. Verb; 
J. Psychol, 1950, 63, 176-185. 

Nelson, S., & Caldwell, W. E. Perception of affective stimuli by normal and schizo- 
phrenic subjects in a depth perception task. J. Gen. Psychol., 1962, 67, 323-335. 

Osgood, C. E. Motivational dynamics of language behavior. In. M. R. Jones 
Nebraska symposium on motivation. Lincoln: Univer. of Nebraska Press, 1957. 


and perception in carly schizo- 


c schizophrenia. 


petition and verbal learning in 
gical Assoc. Mecting, Philadelphia, 


al context and recall of meaningful material. Amer. 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 45 


Payne, R. W. Cognitive abnormalities. In H. J. Eysenck (Ed.), Handbook of abnormal 
psychology. New York: Basic Books, 1961. 

Payne, R. W. An object classification test as a measure of overinclusive thinking in 
schizophrenic patients. Brit. J. soc. clin. Psychol., 1962, 1, 213-221. 

Payne, Е. W. & iedlander, D. short battery of simple tests for measuring over- 
inclusive thinking. J. ment. 1962, 108, 362-367. 

Payne, R. W., Matussek, P., & George, E. I. Ап experimental study of schizophrenic 
thought disorder. J. ment. Sci., 1959, 105, 627-652. 

Pearl, D. Language processing ability of process and reactive schizophrenics. J. Psychol., 
1963, 55, 419-425. 

Raush, H. L. Perceptua 


il constancy in schizophrenia. J. Pers., 1952, 21, 176-187. 

Rinkel, M., DeShon, H. J., Hyde, R. W., & Solomon, H. Experimental schizophrenia 
like symptoms. Amer. J. Psychiat., 1952, 108, 5 8. 

Rodnick, E. H., & Shakow, D. Set in the schizophrenic as measured by a composite 

1940, 97, 214-225. 

& Shakow, D. The relationship of some 

zophrenic disorganization. J. Pers., 1960, 


reaction time index. Amer. J. Psychiat., 

Rosenthal, D., Lawlor, W. G., Zahn, T. P., 
aspects of mental set to degrees of schi 
28, 26-38. 


weig, N. A mechanism in schizophrenia. Arch. Neurol. Psychiat., 1955, 74, 


544-555. 

Rosenzweig, N. Sensory deprivation 
similarities. Amer. J. Psychiat., 

Rump, E. E. A note on “Distance constancy 
1961, 107, 48-51. (a) n А . 

Rump, E. E. The relationship between perceived size and perceived distance. Brit. Ј. 
Psychol., 1961, 52, 111-124. (b) 

Salzinger, K. Shift on judgment of weight 
instructions in early schizophrenics anc 
43-49. 

Samuels, I. Reticular mechanisms 


a and schizophrenia: some clinical and theoretical 


1959-1960, 116, 326-329. 
in schizophrenic patients.” J. ment. Sci., 


s as a function of anchoring stimuli and 
1 normals. J. abn. soc. Psychol, 1957, 55, 


and behavior. Psychol. Bull., 1959, 56, 1-25. 
Schwartz, M., & Shagass, C. Reticular modification of somatosensory cortical recovery 


G clin. Neurophysiol., 1963, 15, 265-271. 
indicators of personality and affect. In L. Uhr & J. G. 


ior. New York: Wiley, 1960. 
1 responsiveness in psychiatric patients. Arch, Gen. 


function. 
Shagass, С. Drug thresholds as 
Miller (Eds.), Drugs and behav 
Shagass, C., & Schwartz, M. Cerebra 


Psychiat., 1963, 8, 177-189. (а) 5 
Shagass, C., & Schwartz, M. Psychiatric disorder and deviant cerebral responsiveness to 


n. Recent advances in biol. Psychiat., 1963, 5, 321-330. (b) 
Shakow, D. Segmental sct. Arch. Gen. Psychiat., 1962, 6, 1-17. 

Shakow, D. Psychological deficit in schizophrenia. Behavioral Sci., 1963, 8, 275-305. 
Shannon, С. E. Prediction and entropy of printed English. Bell Syst. Tech. J., 1951, 


sensory stimulatior 


30, 50-64 
о ex J., Cohen, S. L, & Shmavonian, B. M. Investigation of psychophysiologic 
relationships with skin resistance measures. 1. Psychosom. Res., 1959, 4, 65-87. А 
Silverman, J. Psychological deficit reduction in schizophrenia through response-contin- 
gent noxious re-inforcement. Psychol. Rep., 1963, 13, 187-210. | 
Smith, S, Thakurdas, H., & Lawes, T. G. G. Perceptual isolation and schizophrenia. 


J. ment, Sci., 1961, 107, 839-844. | i | 
Snygg, D., & Combs, А. W. Individual behavior. New York: Harper, 1949. 


Spain, B. Personal communication, 1963. 


46 P. Н. VENABLES 


Steriade, M.. & Demetrescu, M. Reticular facilitation of responses to acoustic stimuli. 
EEG clin. Neurophysiol., 1969, 14, 91-36. 

Stevens, J. M., & Derbyshire, A. J. Shifts along the alert-repose continuum during 
remission of catatonic stupor with amobarbital. Psychosom. Med., 1958, 20, 99-107. 

Stevens, S. S. Decibels of light and sound. Physics today, 1955, 8, 12-17. 

Stevenson, J. A. F., Derrick, J. B., Hobbs, С. E., & Metcalfe, E. V. Adrenocortical re- 
sponse and phosphate excretion in schizophrenia. Arch. Neurol. Psychiat., 197 
78, 312-320. 

Stroop, J. R. Studies of interference in serial verbal reactions. J. exp. Psychol. 1935, 
18, 643-661. 

Sutton, S., & Zubin, J. Effect of sequence on reaction time in schizophrenia. Paper 
presented to International Colloquium on Biological Bases of Age Changes in the 
Speed of Behavior. Cambridge, England, 1963. 

Sutton, S., Hakarem, G., Zubin, J- & Portnoy, M. The effect of shift of sensory modality 
on serial reaction time: a comparison of schizophrenics and normals, Amer. J. 
Psychol., 1961, 74, 924-939. 

Teichner, W. H. Recent studies of simple reaction time. Psychol. Bull., 1954, 51, 128- 
149. 

Thouless, R. H. Individual differences in phenomenal regression. Brit. J. Psychol., 
1932, 22, 216-241. 

Topping, G. G., & O'Connor, N. The response of chronic schizophrenics to incentives. 
Brit. J. med. Psychol., 1960, 33, 211-214. 

Venables, P. H. A short scale for rating "activity-withdrawal" in schizophrenics. J. 
ment, Sci., 19: 103, 197-199. 

Venables, P. H. The effect of auditory and vis stimulation on the skin potential 
response of schizophrenics. Brain, 1960, 83, 92. 

Venables, P. H. Selectivity of attention, withdrawal, and cortical activation, Arch, gen. 
Psychiat., 1963, 9, 74-78. (a) 

Venables, P. H. The relationship between level of skin potential and fusion of paired 
light flashes in schizophrenic and normal subjects. J. Psychiat. Res., 1968, 1, 279- 
287. (b) 

Venables, P. H., & O'Connor, N. Reaction times to auditory and visual stimulation in 
schizophrenic and normal subjects. Quart. J. exp. Psychol., 1959, 11, 175-179. 
Venables, P. H., & Tizard, J. Paradoxical effects in the reaction time of schizophrenics. 

J. abn. soc. Psychol., 1956, 53, 220-224. 

Venables, P. H., & Tizard, J. The effect of auditory stimulus intensity on the reaction 
time of schizophrenics. J. ment. Sci, 1 ‚ 104, 1160-1164. 

Venables, P. H., & Wing, J. K. Level of arousal and the subclassification of schizophre- 
nia. Arch. gen. Psychiat., 1962, 7, 114-119. 

Weckowicz, T. E. Size constancy in schizophrenic patients. J. ment. Sci., 1957, 103, 
475-486. 

Weckowicz, T. E. Autonomic activity as measured by the mecholyl test and size con- 
stancy in schizophrenic patients. Psychosom. Med., 1958, 20, 66-71. 

Weckowicz T. E. & Blewett, D. B. Size constancy and abstract thinking in schizo- 
phrenic patients. J. ment. Sci., 1959, 105, 909-934. 

Weckowicz, T. E., Sommer, R., & Hall, R. Distance constancy in schizophrenic patients. 
J. ment. Sci., 1958, 104, 1174-1182. 

Weinstein, A. D., Goldstone. S., & Boardman, W. K. The effect of recent and remote 


frames of reference on temporal judgments of schizophrenic patients. J. abn. soc. 
Psychol., 1958, 57, 241-248. 


INPUT DYSFUNCTION IN SCHIZOPHRENIA 47 


Wells, F. L., & Kelley, C. M. The simple reaction in psychosis. Amer. J. Psychiat., 1922, 
2, 53-59. 
Wertheimer, N., & Wertheimer, M. Capillary structure: its relation to psychiatric 


diagnosis and morphology. J. nerw. ment. Dis., 1955, 122, 14-27. 
Wilder, J. Modern psychophysiology and the law of the initial value. Amer. J. Psycho- 
ther., 1958, 12, 199-221. 
Williams, M. Psychophysiolo; 
schizophrenic reactions. Psychosom. Med., 
Winer, H. R. Incidental learning in schizophrenics. Un 
Univer., Lafayette, Indiana, 1954. 


Zahn, T. P., Rosenthal, D., & Lawlor, W. G. 
chronic schizophrenic and normal subjects. Paper 


carch, Denver, Colorado, 1962. 
D. Effects of irregular preparatory intervals on 
abn. soc. Psychol., 1963, 67, 44-52. 


gical responsiveness to psychological stress їп сапу chronic 
1953, 15, 456-461. 
published Ph.D. Thesis, Purdue 


G.S.R. orienting reactions to visual and 
auditory stimuli in read to 
Society for Psychophysiological Res 

Zahn, T. P., Rosenthal, D., & Shakow, 
reaction time in schizophrenia. J. 


A THEORY OF VERBAL BEHAVIOR 
IN SCHIZOPHRENIA! 


Loren J. Chapman, Jean P. Chapman, and Glenn A. Miller 


SOUTHERN ILLINOIS UNIVERSITY, CARBONDALE, 


ILLINOIS 


I. Theoretical Position 
A. General Principle 
B. Previous Work 
C. Predictions 
II. Experimental Investigations 
A. Method and Subjects 


B. Experiment I. Misinterpretation ii 
Errors of Exclus 


n Contest . x 
ion from Common 


C. Experiment Il. 
foe M em TEE O 
D. Experiment III The Influence of Strong Contextual 
Cues ae 68 
III. Discussion 74 
References 76 


I. Theoretical Position 


А. GENERAL PRINCIPLE 
uas d tM often misinterpret and misuse words in common dis- 
Her = Ленана this observation is commonplace, and despite the fact 
Missam diverse clinical formulations of the phenomenon have gained 
of the xil Ў егу little has been done in the way of nensis exploration 
ikat T of these errors. The present paper reports mr ce experiments 

rected toward testing predictions from one formulation which 


May account for many of these errors. 
MC ba] which will be presented here was stimulated in part by 
disorder ete which indicate a close similarity between schizophrenic 
iure ЫЛЕ. en and normal error tendencies. There is a large litera- 
„ыс Ss RSE the similarity between schizophrenic thought dis- 
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order and error tendencies of normal subjects under one or another of 
various special conditions which induce errors, These special conditions 
include sleep, psychotomimetic drugs, sensory deprivation, fatigue, re- 
laxed attention, emotional excitement, and oxygen deprivation. However, 
the findings of a number of studies have indicated that a special condition 
is unnecessary. In a series of studies designed to measure clinically promi- 
nent features of the disorder, it was repeatedly found that the kinds of 
errors that were hypothesized to be features of schizophrenic disorder 
were also found in normal control subjects, although in reduced number. 
This was true of (1) associative intrusions into schizophrenic conceptual 
performance (Chapman, 1958), (2) the interpret 


ation of common con- 
cepts in an excessively broad manner so 


as to include incorrect items which 
are similar to correct ones (Chapman and Taylor, 1957; Chapman, 1961). 
(5) the confusion of figurative and liter 


al usages of common words and 
phrases (Chapman, 1960), and ( 


4) the tendency to solve formal syllogisms 
by concluding an identity оГ objects which share a common quality 
(Chapman and Chapman, 1959; Gottesman and Chapman, 1960). 

"There are reasons for believing that these findings reflect an accen- 
tuated expression, in schizophrenia, of normal response biases. It is as if 
the response biases are released and expressed more freely. А response 
bias may be defined as a predisposition toward making a particular one 
of the various possible kinds of responses that one might make to а 
given stimulus (Underwood, 1952). It is called a “bias” because psycholo- 
gists often define it by responses which are incorrect or inappropriate 
in the context in which they appear, but it may produce correct answers 
in those situations in which a biased response and the correct response 
coincide. [Campbell (1959) has catalogued a number of these biases.] 

For both normal and schizophrenic persons, response biases act more 
strongly in situations in which other stimuli are not available as cues 
which limit the variety of responses that are appropriate. For example. 
one strong normal response bias is to respond to 
(An associate here means that kind of response su 
association test.) The reader may observe for hi 
associative bias is by trying to think of 
not associatively related to a single stimul 
that one strong associative response 
normal discourse the range of 


a word with its associate. 
ich as is given on a word 
mself just how strong the 
a list of other words that are 
us word—he wil] probably find 
after another comes to mind. In 
appropriate responses to a word is de- 
fined by the context in which the word appears. This context supplies 
a multitude of cues which guide the selection of the overt response to 
the word. This usually tends to elimin associative responses. Neverthe- 
less, associates occasionally intrude appropriate responses in normal 
speech. However, they occur far more freely if the contextu 


ate 
as in 


al cues are 
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reduced. For example, they occur in the word association test itself in 
Which such contextual cues are minimal since the instruction for the 
test specifies that the S is to say the first word that comes to mind when he 
hears a single stimulus word. Associates are also abundant in the free 
association of normal persons, as on the psychoanalytic couch. 

any schizophrenics free-associate much 
of the time in place of more goal-directed discourse. Similarly, Chapman 
(1958) found that normal associates intrude as incorrect responses in 
schizophrenic conceptual performance more than in normal performance. 
The observation that for normal persons the associative bias is expressed 
more freely when constraining contextual cues are reduced suggests the 
Possibility that a reason for the schizophrenic's excessive yielding to this 
normal response bias may be a lesser responsiveness to contextual cues. 
These are the cues which, for normal persons, guide the selection of 
appropriate responses in place of biased ones. For example, Bleuler's 
(1950, p. 26) patient who, in describing a walk with her family began 
With enumerating the members of her family, listed “Father, son" and 
then added, "and the Holy Ghost.” The context of the sentence would 
have indicated to a normal person that the response “and the Holy 
Ghost” was inappropriate even though it would have been an appropriate 
response to “Father, son” in some other contexts. The utterance then 
May be seen as reflecting a lesser responsiveness to the contextual cues. 
ations indicate that many features of the schizophrenic 
accentuated expression of those overt 
responses toward which normal subjects are biased. It seems likely that 
an elaboration of this principle will explain the schizophrenic misinter- 
pretation and misuse of the meanings of words. This elaboration was 
Originally suggested by spontaneous comments that some schizophrenic 
and normal Ss made during testing W 
asked whether two words have the same meaning. One item asked if 
bicycle means the same as wagon. The schizophrenics often said things 
like, “You can ride on a bicycle, and you can ride in a wagon, so they 
mean the same.” Normal Ss would say something like, “You can ride on 
both a bicycle and in a wagon, but a bicycle has two wheels and a wagon 
has four, so they don't really mean the same." These responses suggested 
that schizophrenics do not weigh simultaneously the several different 
aspects of meaning in order to answer appropriately the question at 
hand, but instead answer by using a more limited number of aspects of 
the meaning. Moreover, the aspects of meaning which schizophrenics 
use to excess appear to be those which are more prominent for normal 
persons. The tendency to interpret a word in accordance with its most 
prominent aspect of meaning is a response bias in the same sense that 


It appears clinically that m 


. These consider 
disorder of thought consist of an 


hen presented with items which 


or 
nm 
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the tendency to respond to a word with associates is a response bias. 288 
may be illustrated most readily with a double-meaning word. If a nor- 
mal person is asked for the meaning of the word grain, he is most likely 
to define it in terms of various food plants such as wheat or oats. How- 
ever, if he encounters the word grain in the context of a sentence which 
contains appropriate cues, he will instead interpret it to mean the lines 
or markings in wood. 

As will be seen below (Section I, C) one may infer from schizophre- 
nics' misinterpretation of words a possible explanation of why they fail 
to respond to the contextual cues which, for normal persons, restrain 
the expression of response biases. But first we will present further con- 
siderations and data on this phenomenon of schizophrenic misinterpret 


is somewhat inexact, 
n or written word itself, ОГ 
as a response within the 
ters the word. It is probably clearer, therefore, 
concerning meaning, to speak of 
"aspects of meaning" of a word. A mean- 
to this usage, is a hypothetical internal event 
vioral response to a word. 
€ used the term "meaning response" and 
some of them have offered measures designed to define it operationally. 
Some of these have been concerned with "affective" or "connotative 
meaning as measured by the semantic differential (Osgood, Suci, and 
Tannenbaum, 1957), Е » if one is investigating the meaning 
antic differential would produce measures of how good 


strong, and how quick or slow the concept dog 18 
felt to be. Other investigators haye been concerned with association as 
а measure of meaning (Bousfield, Whitmarsh, and Danick, 1958; Юееѕе, 
1962). Associative measures of me 


s À aning might indicate that the aspects 
of meaning of dog include “саг” 


These two approaches may yield valuable information about im- 
portant varieties of “meaning,” and the theory of this paper yields pre- 
dictions which might be tested in terms of them, However, the present 
paper is primarily concerned wi ariety of "meaning" which 
is usually called “denotative meaning,” ative meaning is difficult 
to define, but it is what the layman usually means by 
a word. It is the representative of the referent of the w 
or events to which it refers. When the referent 
tive meaning is in large part a description of 
the denotative meaning of dog includes * 


ord, the objects 
is an object, the denota- 
the object. 


For example, 
‘is an animal,” 


“has four legs,” 


or 
оо 
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etc. Meaning responses are measured by obtaining statements of mean- 
ing from judges. The judges are instructed to "tell what the thing 
named is, or what it is like," or to list "aspects of meaning" for the word. 
Evidence concerning relative strength of meaning responses is obtained 
either from the order in which the statements of meaning are listed, or 
by asking judges to rank order them. In this presentation, we are limit- 
ing the term “meaning responses" to internal events which represent 
the referents of a word, and "statements of meaning" to the overt re- 


sponses that judges give to questions concerning the aspects of meaning 


of a word. 

Stated in terms of meaning responses, the theory states that schizo- 
phrenics’ misinterpretations of the meanings of words arise in part from 
mediation of overt responses to words by their strongest meaning re- 
sponses with a relative neglect of their weaker meaning responses, while 
the interpretation of words by normal persons reflects the use of the 
Weaker as well as the stronger meaning responses. It is assumed that 
Meaning responses themselves are similar for schizophrenics and normals, 
Thus predictions of group tendencies in schizophrenic thinking are made 
using the average strength of the various statements of meaning offered 
by a group of normal judges. Variation among schizophrenics in their 
thinking disorder is predicted using variation in the statements of mean- 


ing by normal judges. 


B. Previous WORK 

Chapman and Chapman (1964) found support for this theory from 
à study of the characteristics of words which schizophrenics to an ex- 
Cessive extent regard as synonymous. The tendency of schizophrenics to 
regard some words of only moderate similarity as synonymous is a fairly 
striking feature of schizophrenic thought disorder, and it is manifested 
in their tendency to substitute words for one another in ways that normal 
persons do not. These words often have а similarity that normal persons 
recognize, but do not regard as sufficient to define the words as synony- 
mous. The present theory states that schizophrenics tend to interpret 
words in accordance with the strongest meaning responses, neglecting 
weaker meaning responses. Therefore it predicts that the schizophrenic 
tendency to treat words of similar meaning as synonymous will occur 
more often on pairs for which the same meaning response is strongest 
for both of the two words. For example, 31 of 52 normal judges listed 
“18 an animal" as the strongest statement of meaning for both pig and 
for dog. It follows from the theory that schizophrenics, more than normal 
subjects, should regard pig and dog as synonymous. 

A contrasting word pair is brassiere-tee shirt. The five statements 
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of meaning for brassiere, listed in order of rated strength, were “supports 
breasts,” “is an undergarment,” “clothing,” “used by women, ma 
“made of cloth.” The five statements of meaning for tee shirt were ds 
clothing," “is an undergarment," "used by fellows (or men)," “made of 
cloth," and “is white.” Although these two words have in common three 
of their five statements of meaning, only 13 of 59 normal judges ranked 
the same statement of meaning as first in strength for both of the о 
words. The prediction from the theory, therefore, was that schizophrenics 
would less often deviate from normal control 55 by calling pairs like 
brassiere and tee shirt synonymous than by calling pairs like pig and 
dog synonymous. 

In order to test this formulation 38 
by the measure of the number 
of meaning in first Position о 
pair. In addition, ratin 
were obtained in separ. 


Pairs of words were dichotomized 
of judges who ranked the same statement 
E strength for both members of a word 
85 of degree of similarity between pairs of words 
ate judgments, It was found that for words of the 
range of similarity represented in the study, the measure of shared high- 
strength meaning responses had little relationship to degree of similarity- 
Similarity may either consist of several low-strength shared meaning 
responses or of a single high-strength meaning response. For example, 
big-dog and brassiere-tee shirt Were close on rated similarity despite 
their disparity on shared high-strength meaning responses, Two 19-item 
sets of word pairs were obtained which differed on whether their shared 
meaning responses were high Strength or low strength. 
The tendency to respond to word Pairs as if they were synonymous 
was measured by items of the following format: 
7. "Pig" means the same as 
A; stocking 
B. dog 


C. neither of the above, 


'The schizophrenics were found to €xceed the normal subjects on the 
difference between the number of pairs of the two sets which were called 
the same (p < 001). The schizophrenics judged as synonymous more 
pairs of high-strength shared meaning response than low-strength pairs 
(p < .03), while the normals showed qualitatively the Opposite pattern. 
These differences were found to be independent of the rated similarity 
between the pairs of words. These fingi; icate that schizophrenics 
are more likely to show an abnorm y for regarding bw woes 
as synonymous if the words have strongest meaning response 
in common. The findings Support the contention that schizophrenics’ 
N excess by the stronger normal mean- 
lect of Weaker Meaning responses. 


overt responses are mediated toa 
ing responses, with a relative neg 
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C. PREDICTIONS 

We have restricted the definition of meaning responses to internal 
events which represent the denotative referents of a word. Meaning re- 
sponses to the specific words which are being interpreted are only one 
variety of mediating response. As mentioned previously other external 
stimuli from the context in which the word appears also have an in- 
fluence on which of the several meaning responses to a word are used 
to mediate the overt response. The most important of these contextual 
cues are other words. (However, they also include such stimuli as vocal 
qualities and gestures.) Presumably, these contextual cues „also act 
through internal mediating responses. As discussed above (Section 1, A), 
it appears that schizophrenics often fail to respond to the contextual 
cues which have a restraining influence on the expression of response 
biases for normal persons. The present findings and theory indicate a 
possible explanation for this. It seems likely that the schizophrenic ex- 
cessive use of the strongest meaning responses also holds for other types 
of mediating responses, such as those aroused by contextual cues, in- 
structions, etc. It was found that schizophrenics tend to mediate their 
responses to a word only by the strongest normal meaning response, 
neglecting the weaker. It would then appear reasonable to assume that 
there is a general propensity to use the strongest normal mediating re- 
sponse with a neglect of the weaker whether the mediating response is 
a meaning response ог is instead another kind of mediating response. 
The mediating response to each such contextual cue which guides the 
selection of overt responses may frequently be weak in comparison to 
the mediating response which evokes the biased overt response. . For 
example, if a person is asked to interpret the meaning of the word rare 
in the statement, “Robert says he likes rare meat,” the word meat pro- 
vides a weak contextual cue which for normal persons indicates that 
the correct meaning is “partially cooked” rather than “uncommon,” 
which is a stronger meaning for rare when contextual cues are absent. 
The theory predicts that it is in cases such as this, in which the contextual 
cues are weak, that schizophrenics fail to use these cues. Instead they 
interpret words in accordance with the meaning responses to the words 
which are strongest when the words are encountered out of context. 
The frequent failure of schizophrenics to use contextual cues would 
then be seen as one manifestation of the tendency to guide behavior by 
the strongest normal mediating response. It follows that in situations in 
which the contextual cues аге stronger schizophrenics will use them 
more often and their performance will approach that of normals. (Ex- 


periment III, below, will provide some evidence on this.) 
Thus restated, the theory postulates that schizophrenics’ misinter- 
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pretations of the meanings of words arise in part from mediation of 
behavior by the strongest normal mediating responses with a relative 
neglect of weaker mediating responses. It should be noted that this 
theory does not attempt to account for the deviant performance of 
schizophrenics on verbal tasks by postulating a qualitative difference in 


the mediating responses. Instead the theory assumes that the mediating 


responses of schizophrenics show a marked similarity to those of normal 
persons. The theory states th 


at much schizophrenic misinterpretation of 
words reflects an excessive reliance on the mediating responses which 
are strongest for normal persons. Strength, as referred to in this paper, 
always applies to the degree of relationship between a stimulus and a 
response, but following the convention of behavior theorists, we will 
occasionally refer to it in terms of strength of cues or strength of re- 
sponses. In all cases, however, it is the strength of the “bond” between 
stimulus and response that is implied. 

The assumption of the similarity of Schizophrenics and normal per 
sons in the content and relative Strength of meaning responses was 
investigated in a pilot study. Statements of meaning to 20 common words 
were obtained from a &roup of 20 chronic schizophrenics and a group 
of 22 normal control subjects of similar age and education, Of these 20 
words, 15 had more than one meaning and 5 had only a single meaning. 
1t was found that for 19 of these 90 Words, the Statement of meaning 
which was most frequently given by the normal group was also the 
statement of meaning most frequently given by the schizophrenics. As а 
further check on the comparability of the two 8roups, each subject was 


given a score, for the 15 multimeaning Words, of the number of first 


position statements of meanings that coincided with the meaning most 


commonly given by the normal group as a Whole. The mean of these 
Scores was 13.5 for the schizophrenics and 18.7 for the normals. The 
difference was not significant (t — .46; P = .64). These results lend 


strong support to the assumption that schizophrenics and normals have 
a similar hierarchy of meaning responses 


sponses. The hypothesized excess in schi; 
mediated by the strongest mediati 
by several alternative formulations 


X g the corresponding in- 
ternal events. It could be explained by a strengthening of the strong 


mediating responses, or by a weakening of the weak mediating responses, 
or by a multiplication of the strength of both Strong and weak mediating 
responses, or by a selective failure to respond to weak mediating re- 


EE GE lective inhibitory +, : А 
sponses, ог by some selective inhibitor Y process, There is no evidence at 
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present for choosing among these alternative explanations. In addition, 
even if one could arrive at such a specific formulation in terms of media- 
ting responses, one could not assume an isomorphic representation of 
this pattern in overt behavior. 

One of the predictions from the theory in terms of data is that 


schizophrenics, much more than normals, will interpret words in accord- 
al mediating responses whether or not 


ance with the strongest norm 
at their errors will be more numer- 


these are appropriate. This means th 
ous when the correct overt response is mediated by a weak mediating 
asured by normal group judgments, and these errors will 
ord in accordance with a strong normal 
ponse is mediated by the 


response, as me 
consist of interpreting the w 
mediating response. When the correct overt res 
strongest normal mediating response, schizophrenic accuracy will ap- 
proach that of normals. Although this differential error propensity will 


be found for both schizophrenics and normals, it will be greater for 
these effects will be greater when normal 


Schizophrenics. Moreover, 
hich is the strongest mediating 


judges show greater agreement as to W 
response. 


II. Experimental Investigations 


A. МЕтнор AND SUBJECTS 

The present paper reports three studies which tested these predic- 
tions. АП three studies used, as а research tool, words which have more 
than one meaning in ordinary usage. For example, a pen means both “а 
writing implement" and “а fenced enclosure." The first of these is by 
far the stronger of the two meanings, as shown by our measures of 
strength of meaning response. While the theory is intended to apply to 
the use of all words by schizophrenics, its implications can be con- 
veniently tested with multimeaning words. 

These three experiments were not performed in the order in which 
they are presented, and for all of them the data, except the measures 
of strength of meaning responses, were gathered prior to the formulation 


of the theory. 


1. Schizophrenic Subjects 

The patients in all three studies were long-term chronic schizophre- 
nics who were not receiving drug therapy. They were all native-born 
White male patients whose last admission had been no more recent than 
6 months, and most of whom had spent many years in the hospital. No 
patient was included who showed, either by interview or by examination 
of the hospital records, indications of brain damage or mental deficiency, 
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or who had received electric shock or insulin shock therapy in the 
previous 3 months, or "tranquilizing" drugs for the previous 4 weeks. 
The patients used in Experiments I and III were from a group of 
patients at Chicago State Hospital who were being maintained without 
drugs for purposes of other research. The patients in Experiment П 
were at Kentucky State Hospital in Danville. 

For several years now a major obstacle to conducting meaningful 
research on schizophrenic thinking has been that many hospitalized 
schizophrenics are kept on maintenance dosages of one or another of 
the phenothiazines, commonly chlorpromazine. Since the phenothiazines 
markedly affect clinical Symptoms, they might also be expected to affect 
performance on any research task which reflects psychopathology. A 
number of studies have tested for drug effects on psychological measures 
and most have found them. Some of these have found the phenothiazine 
improves performance on a given measure, while other studies have 
found that it impairs performance. Chapman and Knowles (1964) have 
shown recently that phenothiazines produce an increase in some kinds 
of cognitive errors, and a decrease in others. They found that the drugs 
exacerbate errors that might be described as unsystematic, careless, Or 
random, but reduce those that reflect a misinterpretation of common 
concepts in an overly broad manner. The latter kind of error is a striking 
feature of the disorder of thought in schizophrenia and is probably 
highly akin to some of the kinds of errors that were studied in the 
present research. It seems clear from these results that patients receiving 
the phenothiazines are an unsuitable group for the study of schizophrenic 
thought disorder. 

It is also undesirable to use those patients that the hospital per- 
sonnel find can be conveniently maintained without drugs without а 
worsening of the symptoms. Chapman (1963) has shown that selection 
sults of research 


being able to obtain for all of tk 
were maintained off drugs regardles: 
source of potential bias in the d 

Another source of possible bi 
was the exclusion from the stud Dine 
willing to complete testing. Although this bias is, of course, unavoidable, 
it was somewhat reduced in the present studies by the use of tests of 


very simple format, and by extended efforts го develop rapport with the 
patients over a period of weeks. 


as that could not be completely avoided 
ly of patients Who were unable 
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2. Normal Subjects 


The normal control Ss in Experiments I and III were City of 
Chicago firemen. Those in Experiment Il were hired from among 
maintenance workers at the University of Kentucky, and from among 
unemployed workers at the Kentucky Employment Service. Although the 
Ss from the Employment Service were currently unemployed, no one 


TABLE I 
ATION, AND STANFORD-BINET VOCABULARY SCORE 


MEAN AND VARIANCE OF AGE, EDUC. 
coups MATCHED ON VOCABULARY 


For TorAL GROUPS AND SUBGR 


Age Education Vocabulary 
N x s2 X 52 X s 
Experiment I 
Total groups 
Normal 27 43.2 36.6 10.6 5.2 21.7 29.2 
Schizophrenic 25 444 47.6 10.7 4.0 a 59.9а 
Experiment II 
Total groups 
Normal 22 40.4 93.3 10.7 9.8 23.2 20.6 
Schizophrenic 49 47.3 67.6 9.8 11.6 16.40 35.30 
Matched subgroups 
Normal 15 39.6 100.0 10.9 11.2 212 
5 512 332 116 164 221 90.8 


Schizophrenic 
Experiment III 
Total groups 
$8.7 58.4 11.0 2.5 23.9c 29.6c 
5.7 


Normal 25 3 84 
Schizophrenic 27 15.3 16.5 10.0 19.84 58.54 
Matched subgroups 
Normal 20 384 484 11.0 24 22.8 20. 
20 46.4 11.3 9.8 5.6 22.9 21.7 


Schizophrenic 
@ Information on only 21 subjects. = 
> Information on only 44 subjects. 
c Information on only 22 subjects. 
4 Information on only 25 subjects. 


was included who had not held a job for at least 6 months during the 
previous year. A number of them were currently out of work during the 
winter months, but normally worked steadily in the building trades 
ather. The normal control Ss of Experiment 11 were 
paid either $5.00 or $10.00 to participate in a number of experiments 
including the present ones. The patients for whom they were controls 
Were not paid cash but were given a package of cigarettes or a package 
of chewing tobacco for each test taken. There were no such payments 


in Experiments I and 1. 


during warmer we 
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Table I shows the mean age, education, and vocabulary score for 
the schizophrenic and control groups for each of the three studies. 


B. Experiment I. MISINTERPRETATION IN CoN 


sp 

Experiment I was used to test predictions concerning the misinter- 
pretation of double-meaning words Which are presented in context. 

There are many common English words which have more than one 
meaning in everyday usage. For example, the word rare may mean either 
"uncommon" or "partially cooked" and à person who receives a com- 
munication which includes the word rare must decide which meaning 
is intended on the basis of cues obtained from the context in which the 
word appears. These cues may be strong or weak or—to state it in terms 
of mediating rTesponses—they may evoke Strong or weak mediating re- 
sponses. As will be seen below, the present study was concerned with 
words presented in sentences which contain only weak cues, 

When a multimeaning word is encountered alone, i.e., outside the 
context of other words, the various meanings differ in salience, i.e, the 
meaning responses which convey the various meanings differ in strength. 

If schizophrenics have an inadequate response to weak cues pro- 
vided by the context in which a multimeaning w 
pretation of multimeaning words 
should show an excessive relia 1 
strongest when the words ar 
the correct interpretation of a word in 
relatively weak meaning response, 
the word, and their misinterpret 


ord appears, their inter- 


ntext. Hence, when 
a sentence is consistent with а 
Schizophrenics should misinterpret 
ation should be опе mediated by а 
kind of error should exceed the oppo- 
Inappropriate interpretation mediated by a 
weaker meaning response when a stronger one is appropriate. 

Method. Items were constructed around two common meanings of 
multimeaning words in the following format, 


21. When the farmer bought a herd of cattle, he needed a new ре 
This means: pen. 


A. He needed a new writing implement, 
B. He needed a new fenced enclosure, 
C. He needed a new pick-up truck, 


Here the correct answer is intended to be alternative “В” 
e ‹ f ox . 2 

the answer in alternative “A 15 mediated p 3 

sponse to pen. This item was paralleled þ 


although 
У the stronger meaning re 


y the following item. 
40. The professor loaned his pen to Barbara 


This means: 
А. He loaned her a pick-up truck, 
B. Hc loaned her a writing implement, 
C. He loaned her a fenced enclosure, 
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Here the correct answer was intended to be alternative "B," and 
alternative "C" contains a possible misinterpretation mediated by the 
weaker meaning response. It should be noted in item 21 above that the 
cues from the context are related somewhat weakly to the meaning 
responses to pen. For example, stronger cues would be given for the 
weaker meaning of pen by the statement, “The farmer put his cattle in 
the pen." All of the items of the present study were constructed so as to 
make the cue strength as weak as possible but still of sufficient strength 
to indicate the correct choice to normal persons. 

It is seen that in addition to the alternatives representing the correct 
and the incorrect meanings of the word there is a third alternative in 
each item which is unrelated to the word pen. This alternative, called 
here “the unidentified error alternative," was included as a measure of 
"random" responses, i.e., some Ss might mark either of the other two 
alternatives for reasons other than their understanding of the statement 
that they are asked to interpret. For example, schizophrenics might be 
uncooperative or have difficulty taking the test and so might mark 
randomly. Such random marking should result as often in the marking 
of the unidentified error alternative as in marking of the other two al- 
ternatives. Therefore, on items measuring misinterpretations mediated 
by the stronger meaning responses, the number of unidentified error al- 
ternatives for each 5 was subtracted from the number of markings of the 
alternatives representing the stronger meaning responses, to obtain a 
corrected measure of the predicted error tendency. A similar correction 
was used with items measuring misinterpretations mediated by the 
weaker meaning responses. These corrected scores were used in all analy- 
ses of the data. 

At test of 38 items was constructed in the format of the above items. 
It consisted of one item for each of two meanings of 19 double- or multi- 
meaning words. "Table II lists the 19 words together with phrases which 
indicate the two meanings represented in the alternatives. 

Evidence concerning the relative strength of the two meanings of 
each multimeaning word was obtained by presenting a list of the words 
to a group of normal judges (39 students in introductory psychology), 
with the instruction: "What is the first aspect of meaning you think of 
for each of the following words, what is the second, and what is the 
third?" The resultant statements of meaning were weighted in the fol- 
lowing manner. When a statement of meaning was given first, it re- 
ceived a weight of “3”; when it was given second, a "2"; when given third, 
a “1”; and when not given at all, zero weight. The weights for each 
statement of meaning of a word were averaged across Ss, which yielded 
an index of relative strength by which the two meanings in the test 


could be compared. 
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TABLE II 
Tue Worps Usep IN EXPERIMENT I, THE 


MEAN WEIGHTED SCORES FOR THE Two 
MEANINGS, AND PERCENTAGE OF THE 


Јорсеѕ Wuo Аскер ох Wuicn 
15 STRONGER 


Weighted Percentage agreement оп 


scorea which is stronger 

Rare 
l. Partially cooked 1.15 76 
30. Uncommon 2.64 

Tip 
2. Private information 2 73 
34. Money 1.59 

Diamonds 
3. Red spots on Playing cards 03 97 
36. Precious stones 2.82 

Cross 
5. A religious symbol 1.49 89 
28. An X mark 28 

Yard 
6. A grassy place 2.44 69 
29. Three feet 1.54 

Board 
7. Meals 69 89 
32. Flat piece of wood 2.62 

Corn 
8. A little lump on someone's foot 43 100 
35. Plants in a field 3.00 

Pen 
21. A fenced enclosure 42 97 
40. Writing implement 2.90 

Palms 
22. Part of a person's hands 2.03 50 
39. Plants or trees 2.23 

Crack 
23. Hole or Crevice in a wall 246 92 
45. A sarcastic remark 49 

Pit 
24. Hole in the ground 29 92 
44. Hard stone of a fruit 67 

"Toast 
27. A drink in honor of someone 1.85 70 
43. Heated and browned bread 254 

Bats 
ll. Flying animals 1.54 64 
33. Wooden sticks 2.28 

Deck 
14. Part of ship 2.92 


92 
38. A pack of playing cards 
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TABLE II (Continued) 
Weighted ^ Percentage agreement on 


scoret which is stronger 

Date 

15. Appointment to take a girl out 1.72 92 

37. A piece of fruit from a palm tree 
Racket 

16. Noise 1.26 52 

25. Dishonest business 1.03 
Bark 

17. Sound made by a dog = 53 


42. Outer covering of a tree 


Bank 
19. A financial establishment 87 


4l. Side of a river 1.56 
Fall 
20. A season of the year 1.05 90 
2.82 


47. To trip and hurt oneself 


* Higher score means stronger meaning. 


Table 11 shows the mean strength of the two meanings represented 
by the alternatives in the test items, together with the percentage of 
judges who concurred in the group decision as to which is strongest. As 
seen there, the judges showed high agreement for some words, and lower 


for some others. 

The prediction wa 
would make misinterpreta 
ing response when the wea 
context of the sentence п 
would also expect schizophre 
mediated by a meaning respo 


s that schizophrenics, more often than normal Ss, 
tions mediated by the stronger normal mean- 
ker meaning response was appropriate in the 
a which the word appeared. However, one 
nics sometimes to make misinterpretations 
nse which according to the group norms 
is weaker than the correct one. This should occur because, as seen in 
"Table II, there is not complete uniformity among people as to which is 
the stronger meaning response. However, misinterpretations in this di- 
rection should be fewer than those in the predicted direction, and they 
should occur more often on the words for which the judges show lower 
agreement as to which is the stronger meaning. 

Results. The means of the corrected. scores for misinterpretations 
mediated by the stronger and by the weaker meaning responses are 
shown in Table III. These data were analyzed using ¢ tests. Double-tailed 
probability values are reported. The schizophrenics exceeded the normal 
Ss in the wrong interpretations mediated by stronger meaning responses 
(z = 5.18; p <.001) and in wrong interpretations mediated by weaker 


meaning responses (z — 246; p< .02). However, on a score of the dif- 
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ference between the Scores for the two kinds of misinterpretation the 
schizophrenics again exceeded the normal Ss (z = 4.99. p «.001), owing 
to the schizophrenics’ differentially greater reliance on the stronger 
meaning responses. 

TABLE III 


ERPRETATIONS MEDIATED ny STRONGER AND 
WEAKER MEANING RESPONSES 


MEAN NUMBER ОЕ Міѕіҳт, 


Total groups 


Schizophrenic Normal 


Stronger 
Weaker 
Strong minus Weak 


Superficially, the finding that the schizophrenics exceeded the normal 
Ss in misinterpretations mediated by weaker meaning responses might 
appear contradictory to the theory. This finding is nevertheless consistent 
with the theory if the items 


which contributed to the difference were 
those on which the judges were Jess unanimous as to which is the 
stronger meaning. The logic is that if a meaning response which is 
weaker by the group judgmental norms is nevertheless the stronger of 
the two meaning responses to this word for a Minority of the judges, 
it is also stronger for a minority of the schizophrenics. To test this, the 
19 items for which the weaker meaning response mediated the correct 
answer were divided into two sets, 10 for wh 


5 ich the judges had relatively 
high agreement (89% to 100% i 


] ) On which was Stronger, and 9 for which 
the judges had relatively low agree 


idenc d, OW agreement (50% to 87%). (The items in 
these sets may be identified in Table IT), A, seen in Table IV, the dif- 


TABLE Iv 
MEAN NuMBER or MISINTERPRETATIONs MEDIATED BY үү 


10 Irems witu Ниси AGREEMENT ON RELATIVE STRE 
WITH Low AGREEMENT: 


Low Agreement 
High Agreement 


ference between the normal and schizophrenic Ss on misinterpretations 
i d 
mediated by weaker meaning responses was almo 


veake : Ў St completely accounted 
for by the nine items on which the judges tended to disagree. The 
difference between groups was significant (22295 


jid р 193 001) for these 
nine items. For the remaining 10 items оп Which the ; ) 


A THEORY ОЕ VERBAL BEHAVIOR IN SCHIZOPHRENIA 65 


One might wonder whether the finding that schizophrenic misinter- 
pretations are mediated more by stronger meaning responses than by 
weaker ones might merely reflect a lower intellectual functioning of the 
schizophrenics. Disturbed chronic schizophrenics show a lowered ability 
in handling any complex task—even a vocabulary test like the present 
one. The items were such that when the wrong interpretations were 
mediated by stronger meaning responses, the correct interpretations were 
mediated by weaker ones. It may require greater intellectual ability to 
answer when it requires mediation by a weaker 
Meaning response than when it requires a stronger one. In order to check 
On this possibility, it seemed most suitable to use another vocabulary 
test since the experimental instrument was itself a vocabulary test. 

It was possible to obtain Stanford-Binet vocabulary scores for all of 
the normal Ss and for 21 of the 25 schizophrenics (the remaining four 
patients being untestable on this instrument). It is most appropriate to 
test for the relationship in the normal group alone, since the lower 
vocabulary score of the schizophrenics may reflect inability to take the 
test in addition to lowered knowledge of words. For normal Ss the score, 
for the entire test, of the difference between the two kinds of misinter- 
pretation had only a slight nonsignificant relationship to Stanford-Binet 
vocabulary score (r = —.13) in the direction of a high difference score 
tending to accompany а low vocabulary score. Further evidence against 
the importance of vocabulary skill in accounting for the results is that 
the mean Stanford-Binet vocabulary score for the 21 schizophrenics 
was, as shown in Table I, only slightly lower than that of the normal 
Ss, and the difference did not approach significance (z = .75; p = .46). 
Nevertheless, in order to make certain that the difference between nor- 
mal and schizophrenic groups on the difference score was independent 
of vocabulary skill, an analysis of covariance using both normal and 
schizophrenic Ss was computed in which the effects of Stanford—Binet 
vocabulary were partialed out. The schizophrenic and normal groups 
were again found to differ in the predicted direction on the difference 
score (F = 22.59; df = 145; p <.001). 

It seems clear, therefore, that we are justified in concluding not 
only that schizophrenics’ excessive misinterpretations of double-meaning 
are more often mediated by the stronger than by the weaker 
at in addition this error propensity cannot be 


distinguish the correct 


words 
Meaning responses, but th 
attributed to lowered vocabulary skill. 


C. EXPERIMENT П. ERRORS OF EXCLUSION FROM COMMON CONCEPTS 


While schizophrenic thought disorder has many striking features, 
perhaps the most widely discussed feature of all is the difficulty in 
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assigning objects to common conceptual classes. The better known dis- 
cussions of this difficulty include those by Cameron (1939), Goldstein 
and Scheerer (1941), Hanfmann and Kasanin (1942), McGaughran and 
Moran (1956), and Vigotsky (1934). It n 


nay be that some errors of schizo- 
phrenics in the assigning of objects to 


common conceptual classes arise 
al meaning responses either to the 
name of the conceptual classes or to the names of the objects. The 


present study investigates only the first of these, errors which reflect 
excessive reliance on a strong me 


neglect of weaker ones. 


The conceptual class names used were selected from the many 
common conceptual classes having more than one meaning. For example, 
the word head may mean either the head of an animate thing, such ue 
of a man or a horse, or the head of an inanimate thing, such as of a nail, 
a match, or a hammer, However, the animate meaning is the stronger of 
ponses to head. It was hypothesized that when 
asked to sort the names of objects into such double-meaning conceptual 

» Schizophrenics more than. normals would interpret the concepts 
solely in accordance with the stronger normal meaning response to each 
class name and would exclude examples mediated by the weaker mean- 
ing response. 

Method. The S Was required to sort 
with the name of Опе object on it, into tw 
in a certain named conceptual class 

Two boxes with slots in the t 
guide card naming the conceptu 
box. 


aning response to a class name, with a 


a series of cards, each marked 
© piles—those which belonged 
» and those which did not. 

Ор were placed in front of the S. A 
al class was attached to the top of each 


For Task A, the guide card on one box said “Things that have a 
head” and on the other box, “Things that do not have a head." The 
sorting cards were of three types, names of living things that have a head 


(such as dog and horse), names of inanimate objects that have a head 
(such as nail and pin), and irrelevant items. 


a head” were: 
animate: dog, mouse, horse, pig, 
inanimate: pin, Spear, match, nail, 
irrelevant: sheet, blanket, napkin, 
mat, dish rag. 


cat, sheep, cow, goat. 
spike, hammer, screw, arrow. 
tablecloth, quilt, towel, bath 


Task B: The response cards for the concept of “Things that have 
legs” were: 
animate: cow, mouse, rat, horse, goat, c 


inanimate: chair, bed, piano stool 
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irrelevant: pear, banana, peach, apple, orange, grape, pineapple, 
lemon. 
Task C: The response cards for the concept of “Things that have 


teeth" were: 
animate: dog, horse, cow, lion, man, alligator. 
inanimate: zippers, saw, rake, comb, bear trap, barbers' clippers. 
irrelevant: hickory, oak, pine, birch, cedar, maple. 
Task D: The response cards for the concept of “Things that have 


skin” were: 
goat, elephant, horse, pig, dog, cow, sheep, cat, man, rat. 


animate: 

inanimate: apricot, prune, tomato, potato, apple, cherry, banana, 
peach, pear, plum. 

irrelevant: sled, television set, baseball bat, golf clubs, playing 


cards, ice skates, radio, checkers, marbles, bowling ball. 


Evidence concerning differences in strength of meaning responses 
to each of these four class names was obtained in the same manner as 
described in Experiment I. Twenty students in an introductory psychol- 
Ogy course were used as judges. Each judge was asked to give aspects of 
meaning for head, legs, teeth, and skin. The judges almost uniformly 
interpreted all four concepts primarily in terms of the animate ex- 
amples, although the other meanings were sometimes given. AM 920 


as their first response a meaning indicating an interpreta- 


Students gave 
8 head, for legs, and for teeth, 


tion in terms of the animate meaning for 
while 17 of the 20 did so for skin. 

The prediction was that schizophrenics, more than normals, would 
tend to respond to the class names on the experimental instrument in 
terms of the animate meaning only, and therefore, more than normals 
Would exclude names of inanimate examples from the conceptual 


classes. 


However, random placements would be expected to inflate the num- 


ber of exclusions of both types by the schizophrenics: дек texample, 
Schizophrenics would be expected more often than normals to sort care- 
lessly, or to have difficulty reading the cards, or to sort according to box 
Position. Such errors should result in excluding both animate and in- 
animate representatives from the named conceptual class even, though 
such “random” errors do not truly reflect the patients’ interpretation of the 
conceptual class. The number of “irrelevant” cards placed in the named 
conceptual class was used as a measure of such random error in the same 
manner as the number of markings of “unidentified error alternatives” 
was used in Experiment I as a correction for random marking. Corrected 
scores of animate exclusions and inanimate exclusions were computed 
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by subtracting the number of the incorrect placements of the irrelevant 

cards from the number of both the animate and inanimate exclusions. 
Results. Between-group comparisons were made using a # test. 

Probability values corresponding to double-tailed tests are reported. 
The mean corrected scores of animate and inanimate exclusions are 


shown in Table V. The results were very much as hypothesized. As seen 


TABLE V 
EXCLUSIONS or ANIMATE 
REPRESENTATIVES FROM THE CONCE 


MEAN CORRECTED SCORE ОЕ AND INANIMATE 


UAL CLASSES 


Schizophrenic Normal 
Animate 16 19 
Inanimate 1L61 3.81 


in Table V, the schizophrenics exceeded the normal $$ on the number of 
exclusions of the inanimate representatives from the named class (z = 
3.07; p < -01) but were about €qual to the normal subjects on exclusions 
of the animate members. The score of the difference between the num- 
ons distinguished the two groups (z = 


nat the schizophrenics tended more than 
the normal Ss to narrow their interpretation of the conceptual classes 
to that one interpretation mediated by the strongest normal meaning 
response, 


As a check on whether these results might reflect lowered vocabulary 
ability in schizophrenia 


» the difference Score was correlated with Stan- 
ford-Binet vocabulary score for the normal Ss. The correlation was .03, 
i s and the schizophrenic error 
wledge of vocabulary. Using 
score, the schizophrenics were again 
s difference Score (z = 2.89; р < .01). 


D. Experiment III. THE INFLUENCE OF STRONG CoxrExTUAL CUES 


Since in the experiments presented above 
showed some failure of medi g responses, one 
absolute loss, in at least some of 
produce responses mediated by some of the 
weaker meaning responses regardless of the contextual cues, According 

е case. The theory states that the deficit in 
performance arises from an excessive reliance on the strongest mediating 
responses regardless of whether the mediating responses are aroused by 
the word itself or by a contextual cue. Therefore, it follows that if the 
contextual cues that indicate the appropriateness of the weaker meaning 
response are extremely strong, and contextual cues for using the strongest 
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meaning response are absent, the schizophrenics’ use of the weaker mean- 
ing response will approach that of the normals. If, contrary to the 
theory, schizophrenia is accompanied by an absolute loss of some weaker 
meaning responses, then schizophrenics should show a deficit in their use 
regardless of the context. Experiment Ill provides a comparison of 
schizophrenic and normal performance on ability to use weaker meaning 


responses under these conditions. 
Method. The task used was a multiple-choice vocabulary test for 


double-meaning words. The task required 5 to select the correct meaning 


from among four alternatives, of which the first two alternatives in- 


cluded both a correct and an incorrect answer, the third stated "neither 
of the above," and the fourth, "I don't know." There were two items for 


each multimeaning word, one for each of two meanings. For example, 


the word bear was presented in two items. 
28. The word bear may mean: 
А. to carry 
B. to command 
C. neither of the above 
D. I don't know. 
64. The word bear may mean: 
A. a sharp end 


B. an animal 
C. neither of the above 


D. I don't know. 

The meanings corresponding to the stronger and weaker meaning 
responses and their relative strength were identified by the same proce- 
dure as used in Experiment J. This measure indicated that the meaning 
of bear corresponding to the stronger meaning response is that of “ani- 
mal.” If some schizophrenics have an absolute loss of some of the weaker 
meaning responses, the schizophrenic group should show lower accu- 
racy than normal Ss on items calling for weaker meaning responses to 
words than on items calling for stronger ones. However, the presentation 
of the statement of the weaker meaning as one of the alternatives fur- 
nishes an extremely strong contextual cue which indicates that the cor- 
rect response is mediated by that weaker meaning response. The choice 
of the alternative corresponding to the weaker meaning is actually 
mediated by a very strong mediating response in this case. Therefore, 
the theory predicts that schizophrenics would use this strong mediating 
response to give the correct answer corresponding to the weaker meaning. 
rument consisted of 130 items of which 74 


The experimental inst 
ere constructed from 37 double- or multi- 


(the experimental items) w 
meaning words, i.e., 2 items per multimeaning word. There were also 


46 items constructed for single-meaning words and 10 filler items for 
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TABLE VI 
THe Worps Usep ix EXPERIMENT III AND THE MEAN WEIGHTED SCORES 


FOR THE Two Mrax 


Weighted scores 
Barrel 


l. Mctal tube on a gun 


36 
31. Large container 2.85 
Ruler 
2. Measuring stick 2.28 
32. A person who governs 1.59 
Trunk 
6. Part of an elephant 56 
42. The main stem of a tree 59 
Deal 
7. A bargain 2.56 
44. To pass out playing cards 1.62 
Log 
9. Piece of wood 2.87 
43. The daily record of a ship's voyage 67 
Deck 
70. A pack of playing cards 92 
90. The floor of a ship 2.92 
Date 
88. A sweet fruit from a kind of palm trec 51 
126. Appointment with a girl 172 
Spring 
26. A season of the year 2.15 
52. То leap or jump 69 
Run 
27. То move the legs quickly 2.92 
53. To be a candidate for election 05 
Bear 
28. To carry 36 
64. An animal 2.72 
Board 
38. A flat piece of wood 2.62 
63. Meals provided for pay `69 
House 
4l. A building in which People live 2.92 
74. An assembly for making laws 105 
Note 
46. A very short letter 249 
73. A musical sign .38 
Jar 


48. A container, as of glass Sis 
81. To shake or rattle 
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TABLE VI (Continued) 


Weighted score? 
Seal 
51. A kind of sea animal with flippers ый 
77. A design stamped on a piece of paper 28 
Plate | 
57. A dish 
85. The home base in baseball 5 
Mars . Ре 
60. A planet | 
88. A god of war : 
Point M 
61. A sharp end es 
87. To aim * 
= 2.59 
66. A long slender piece of wood z 
98. An end of a magnet 
Sta 
j 2.92 
67. A heavenly body — p 
97. An outstanding movie actor 
"d 2.82 
71. The highest point or part = 
194. А toy that spins on а point 
Bolt n 
78. То run away a 
193. A strong pin of metal or wood x 
Hail Реа 
79. A shout of welcome e 
129. Frozen rain 
Count 
1.03 
82. A European nobleman E 
122. To add up 4 
Sign 
84. To write one's name i 
195. A billboard А 
Club 
89. A heavy stick pe 
110. A social group 9. 
lron 
92. A metal ы 1.90 
108. To press clothing 164 
Bowl : 
93. A hollow rounded dish | 218 
109. To play 2 game with a ball and pins 123 
Box 
ntainer 237 
ede t with the fists A 


To figh 


- 
N 
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TABLE VI (Continued) 


99. A growth on the head of cattle and deer 


-36 

115. A kind of musical instrument 2.82 
Fire 

100. A flame 3.00 

114. To discharge from a job 15 
Lock 

101. A portion of hair -08 

117. A means of fastening doors 2.90 
Saw 

102. Looked at .82 

119. A tool for cutting 246 
Fine 

104. A sum of moncy paid as a punishment 1.82 

127. Excellent 2.1 
Light 

105. Not heavy 26 

128. To set fire to 43 
Racket 

ПІ. A loud noise 1.26 

121, A dishonest way of making money 1.03 
Grain 


112. Seed of pl 


ants, like wheat 
120. Little ji 


and ma 


"BS in wood 


which a correct 
The latter tw, 


alternative Was not listed 


were included 
and to reduce th 


among the four alternatives: 
to prevent Ss from discove! 
€ set for never marking the 
rds used in the experimental 
in the alternatives, and the 


o kinds of items 
ing the nature of the task 
last two alternatives, Table VI jj 


items, together with the meanings offered 
mean strength of the two meanings, 
Results. For this task, se of Experiments 1 and II and 
the earlier study reported in Sect; е score measuring differential 
use of stronger and weaker meanings Was related to vocabulary skill in 
normal Ss. The correlation of stan, Binet Vocabulary score with the 


Sts the 37 wo 


weaker and stronger Normal $$ Was —.48 (p < .05), 
which means that the brighter s; showed less difference in their error 
scores on the two kinds of Items. As seen in Table I, the schizophrenics 


: € normal Ss on Stanford-Binet vocabulary- 
It seemed inappropriate, therefore, to use the entire normal and schizo- 


phrenic groups in testing for the experimenta] effect since the predicted 


M 
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difference might be found simply as a reflection of the lowered vocabu- 
lary skill of the patients. For this reason, the experimental effects were 
evaluated using subgroups of 20 normal and 20 schizophrenic Ss closely 
matched on Stanford-Binet vocabulary score, as shown in Table I. 

Table VII shows the median number of errors for these matched 
g for answers mediated by stronger and 
weaker meaning responses. As seen there, the scores were quite similar 
for the two groups, although qualitatively the pattern of scores was 
slightly in the direction opposite to that which would result from a 
Schizophrenic loss of weaker meanings. The difference between the 
groups on the number of weaker meanings failed minus the number of 
stronger meanings failed did not approach significance (z — .18; p — .95) 
as determined by a double-tailed Mann-Whitney test. 


TABLE УП 
; FOR WHICH STRONGER AND WEAKER 
‘CORRECT ANSWERS 


subgroups on the items callin 


MepIan Error Scores ON It 
MEANING RESPONSES YIELD C 


Schizophrenic Normal 
Stronger 79 70 
2.07 2:21 


Weaker 

These results are in striking contrast to those obtained in Experi- 
ments I and II. It appeared possible that this difference in results could 
be a consequence of differences in the stimulus words instead of the 
hypothesized differences in strength of contextual cues. For example, if 
the weaker meanings of the words in Experiment 111 were not quite as 
Weak as those in Experiment I, the schizophrenics might show less dif- 
ference between the two kinds of items in Experiment III for that rea- 
son alone. This possibility can be evaluated by examining the mean 
Strength of the stronger and weaker meaning responses as listed in Tables 
ll and VI. In Experiment I, the mean strengths of the stronger and 
Weaker meanings were 241 and 1.01, respectively; in Experiment 111, 
2.33 and .77, respectively. Thus, in Experiment III, the weaker meanings 
are qualitatively weaker (.77 as compared with 1.01) than in Experiment 
I and the difference in the stronger and weaker meanings is greater for 
Experiment 111 (1.56 compared to 1.40). These differences in relative 
Strength are slightly in the direction opposite to that which might ac- 
count for the differences in the results. Therefore, it seems most likely 
that the differences in the results obtained with the two tasks are attrib- 
utable to the differences in the strength of the contextual cues. 

These findings are inconsistent with the interpretation that schizo- 
phrenics have lost access to weaker meaning responses. However, the 
findings do support the contention of the present theory that schizo- 
phrenics’ overt responses, more than those of normal persons, are medi- 
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ated by the strongest mediating responses, regardless of whether they 
are aroused by the stimulus words or by contextual cues. 


IH. Discussion 


Although the theory emerged from other research (Chapman and 
Chapman, 1964), the primary data of these three studies were gathered 
prior to the formulation of the theory, and only the measures of mean- 
ing responses were gathered with the theory in mind. Therefore, it was 
not possible to design the experiments so as to maximize the hypothe- 
sized effects. It would have been easier to support the theory using only 
those double-meaning words for which there is a sharp differentiation 
between the strengths of strong and weak meaning responses. Also, 
ideally, the same stimulus words should have been used in Experiments 
I and Ш. However, the Support of the theory by the data is perhaps 
more convincing since they were not gathered in a manner designed to 
support it. 

The theory generates predictions, enumerated below, concerning the 
conditions under which a wide range of well-known features of schizo- 
phrenic disorder of thought will occur and will not occur. Insofar as 
these predictions should be confirmed in future research, we will be 
able to conclude that the theory subsumes these various features of 
thought disorder under a single explanatory principle. 

In the studies reported in the present paper it was found that the 
theory predicts the ways in which schizophrenics misinterpret double- 
meaning words presented in context. This kind of misinterpretation is 
very similar to the confusion of usages of words and phrases which have 
both a literal and figurative (or metaphoric) usage. This kind of con- 
fusion has recently received extensive attention fri 
Bateson, Jackson, Haley, and Weakland (1956) 
metaphoric and literal usages as an import 


om psychopathologists. 
view the confusion of 


ant feature of schizophrenic 
thinking. The present theory would predict that schizophrenics will 


make such misinterpretations under the same conditions as they mis- 
interpret other double-meaning words, i.e., they will choose the inter- 
pretation mediated by the strongest normal mediating response regard- 
less of whether it is figurative or literal, 

Several investigators have been interested in the observation that 
schizophrenics seem to have a special difficulty understanding humor. 
A great many jokes are dependent on double Meanings of words, and 
seeing the point of the joke requires mediation of overt response by both 
strong and weak meanings of the word. The theory, therefore, yields 
clear predictions concerning what kinds of jokes schizophrenics should 
have special difficulty in understanding. 
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The theory also generates predictions which may resolve a seeming 
contradiction between clinical observations and experimental evidence 
concerning errors in syllogistic reasoning. Both von Domarus (1944) 
and Arieti (1955) have reported that an especially striking feature of 
schizophrenic disorder of thought is a tendency to conclude, in syllogistic 
reasoning, that two objects are identical if they share a common quality. 
However, Gottesman and Chapman (1960) found no differences between 
schizophrenics and normals on a measure of such errors. The present 
theory would predict that the von Domarus error is stronger in schizo- 
phrenics than in normal subjects only when the quality shared by the 
two objects corresponds to a strong mediating response to each of them. 
This was not the case for the syllogisms used in the Gottesman and 
Chapman study. If further research indicates that the уоп Domarus 
phenomena occur only when the shared quality corresponds to a high- 
Strength meaning response, it will appear that such. phenomena are 
better accounted for by the present theory. 

The theory also predicts that schizophrenics should show abnormally 
Breat semantic generalization or abnormally great transfer in a verbal 
learning task across some kinds of words and not others. The heightened 
semantic generalization or transfer should occur across pairs of words 
which share meaning responses of highest strength but which also have 
other meaning responses which are not shared. For pairs of words which 
do not fit this specification, schizophrenics should not show excessive 


semantic generalization or transfer effects. 
accounts for some of the behaviors, such as schizo- 


phrenics show on conceptual sorting tests, which are called “concrete” 
thinking by various writers, e.g. Goldstein and Scheerer (1941). For 
example, on one task of the object-sorting test the subject is instructed 
to sort objects together which “belong together,” and on another task 
he is presented objects already grouped together and is asked, “Why 
do these objects belong together?” The theory predicts that schizo- 
phrenics' sorting on the first task and explanations of belonging- 
ness in the second will be mediated to an abnormal degree by the 
Strongest meaning responses to the objects. This means that the schizo- 
phrenics’ grouping of objects and the principle of sorting that they offer 
in response to a question concerning belongingness should show medi- 
ation by normal meaning response of highest strength regardless of 
whether the response would be labeled “abstract” or “concrete.” For 
example, the theory predicts that schizophrenics will show an excessive 
use of sensory qualities only for sets of objects to which the strongest 
normal mediating response is that sensory quality. 

The data of Experiment 11 showed that schizophrenics’ interpreta- 


The theory also 
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tion of the conceptual class name in accordance with its strongest mean- 
ing response may produce errors of overexclusion of objects from the 
conceptual class. The theory may also account for errors which appear 
superficially to be an opposite kind of error, that is, errors of overinclu- 
sion, and it again predicts the kinds of items on which such errors will 
occur. Schizophrenics should exceed normal persons in the inappropriate 
inclusion of names of objects in a conceptual class when the strongest 
normal meaning response is shared by the incorrect objects and objects 
that properly belong in the conceptual class. 

Another advantage of the present theory is that it might be used 
to make predictions concerning individual differences among schizophre- 
nics as to the items on which they make various errors. This could be 
done by obtaining meaning responses from the patients themselves for 
the words or objects which are used on the tests. 

In comparison to other theories of disordered thought in schizophre- 
nia, the present theory is both broad in the range of its predictions and 
precise in the nature of its predictions. It is precise in that it predicts a 
certain kind of error for each situation, and predicts the kind of stimuli 
with which that kind of error will appear and will not appear. A major 
virtue of the theory is that its predictions are readily amenable to ex- 
perimental test. 

Even if all of these predictions should be confirmed, the present 
writers would not wish to claim that the theory accounts for every 
deviant verbal production of schizophrenics, It may be that other prin- 
ciples, as well as the present one, are necessary to produce an adequate 
descriptive system. Also, while the theory and data presented here have 
been in terms of meaning responses held in common by both schizo- 
phrenics and normals, it is possible that in addition Schizophrenia pro- 
duces some increase of deviant meaning responses. Also, this theory does 
not go beyond description to attempt an explanation of the etiology of 
the disorder. Nevertheless, the usual first step in understanding a dis- 
order is to describe it, and the chief attraction of the present theory is 
the promise that it holds for reducing the number of principles neces- 
sary for that description. 
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The purpose of this paper is to invite consideration of the experi- 
menter himself as a source of variance in the results of psychological 
experiments. As different as experiments may be from one another with 
respect to purposes, procedures, and subject sampling, they all have in 
common that someone must collect the data—the experimenter or his 
surrogate. To the extent that the experimenter himself may be a 
significant determinant of the results of his research, it may become 
necessary to re-evaluate carefully the results of experiments completed 


and the design of experiments proposed. 


1 Most of the studies summarized were supported by research grar 
P nts (G-176: 
G-24826, GS-177) from the National Science Foundation. ш ( 85, 
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We may distinguish two broad types of effects which experimenters 
may have upon their results: (1) those which operate without directly 
affecting the subjects of the experiment and (2) those which lead to sub- 
jects changing the nature of their responses. 


I. Effects Not Altering Subjects’ Responses 
A. OBSERVER EFFECTS 


In one way or another the experimenter must observe the subjects’ 
responses. By observer “effects” or “error” we mean over- or underestima- 
tion of some criterion value. When two observers disagree in an observa- 
tion, each may be said to err with respect to the other. Both may be said 
to err with respect to some third observation which may, for various 
reasons, be a more or less usefully employed criterion. Given a popula- 
tion of observations, we may choose to define their mean value as the 
“true” value and regard all observations not falling at the mean as being 
more or less in error as a direct function of their distance from the mean. 

Observer errors or effects may be distinguished from observer “bias” 
by the fact that observer errors are randomly distributed around a “true” 
or “criterion” value. Biased observations tend to be consistently too high 
or too low and may bear some relation to some characteristics of the 
observer (Roe, 1961), the observation situation (Pearson, 1902), or both. 

In considering the act or sequence of acts constituting the observa- 
tion in the scientific enterprise, we may distinguish conceptually among 
locations of error or bias. The error of “apprehending” occurs when some 
sort of misrecording occurs between the event observed and the observer 
of the event. We may include here such diverse sources of apprehending 
error as differing angles of observation (George, 1938), imperfections in 
the sensory apparatus, relay systems, cortical projection areas, and the 
like. The error of recording may be conceptually distinguished from the 
apprehending error. We assume an errorless act of apprehending but a 
transcription of the event to paper, to the ear of another observer, or to 
another instrument which differs from the eve 
hended. In actual practice, of course, when an event or observation is 
recorded in error with respect to some criterion, we cannot locate the 
error as having occurred in apprehending, in transcribing, or in both 
processes. There is no way we can isolate an apprehending error uncon- 
founded with a recording error. Computational errors are more clearly 
distinguishable from the foregoing errors since they involve the incorrect 
manipulation of recorded events, Incorrectness is usually defined here by 
the formal rules of arithmetic operations. 

The magnitude and extent of observer effects 


nt as correctly appre- 


are well known in the 
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history of science (Rosenthal, 1963a). In our own discipline, Boring 
(1950) has made us particularly aware of observer variance by his discus- 
sion of the personal equation of the astronomers. Most contemporary 
discussions of observer effects, however, seem to have become more per- 
functory. Reference is made to modern instrumentation which may serve 
to eliminate observer error. That observer effect may be reduced by 
mechanical means seems reasonable enough. That instrumentation may 
not eliminate observer effect must also be considered. If the instrument 
be a dial, it must be read by a human observer. If the instrument be a 
computer, the printout must likewise be read by an observer. Observer 
effect or variability in the reading of scales was well noted by Yule (1927). 
A general error tendency which he found was the inclination to read 
scales to quarters of intervals rather than to tenths. Empirical analysis of 
his own observer effect revealed to Yule his tendency to avoid the number 
7 as a final digit and to favor the numbers 8, 9, 0, and 2. Yule, inci- 
dentally, was relatively optimistic that proper training could eliminate 
observer error. The equally reknowned methodologist R. A. Fisher 
(1936) did not, however, seem to share this optimism. 

There have been few investigations of observer effects in the 
psychological experiment. A notable exception was the work of Sheffield, 
Kaufman, and Rhine (1952). In an experiment in psychokinesis, they 
filmed the actual fall of the dice which subjects were trying to influence. 
They found subjects believing in the phenomenon to make tallying 
errors in favor of the hypothesis. Subjects who disbelieved made the 
opposite type of tallying error. | 

Both recording as well as computational errors of experimenters 
were systematically and explicitly studied for a person perception task 
experiment (Rosenthal, Friedman, Johnson, Fode, Schill, White, and 
Vikan-Kline, in press). In that study, each subject wrote on a pad his 
rating of the degree of success OF failure of persons pictured in photo- 
graphs. The 30 experimenters of this study transcribed these ratings to a 
master data sheet during the course of their interaction with their sub- 
jects. A comparison of experimenters’ transcriptions with subjects’ record- 
ings revealed only 90 errors out of 3000 data transcriptions. All errors 
were made by 12 of the 30 experimenters. Errors were not randomly 
distributed but tended to fall in the direction of the experimenters’ 


hypotheses. 
The computation task for 
to sum the 


the experimenter in the study under 
discussion was simply 90 ratings given him by each of his 5 
subjects. Most experimenters erred computationally and nonrandomly so. 
Not only were more biased experimenters more likely to make computa- 
tional errors in their favor; they also tended to make larger computa- 
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tional errors. In this same experiment, all subjects rated their experi- 
menters on the variable of "honesty" during the conduct of the 
experiment. The mean rating was quite high on honesty and the range 
of ratings quite small. Experimenters could not actually have been 
"dishonest" even if they had been so inclined. During the experiment, 
the co-investigators had all experimenters under surveillance (a fact 
apparent to all experimenters). In spite of the restricted range of the 
ratings of the experimenters’ honesty made by the subjects, these ratings 
predicted far better than chance whether the experimenter would sub- 
sequently favor himself more in the making of a computational error. 


In the study under discussion, we found a significant correlation 
(48, p = .01) between the occurrence of recording errors and the occur- 
rence of computational errors. Thus, experimenters who erred in data 
transcription tended to err in data processing. However, those experi- 
menters who erred in the direction of their hypothesis in their recording 
errors were not any more likely to err in the same direction in their 
computational errors. Numerical errors seemed then to be a consistent 
characteristic while directionality of error vis-a-vis hypothesis did not. 


We should note here that the over-all effects of both recording and 
computational errors on the grand means of the different treatment 
conditions of the experiment were negligible. An occasional experimenter 
did have some real effect on the data he obtained; an effect which could 
be serious if an entire experiment depended on an experimenter who 
was prone to err numerically. Only rarely in the psychological literature, 


however, is attention called to a real or alleged numerical error (Hanley 
and Rokeach, 1956; Wolins, 1962). 


B. INTENTIONAL Errrcrs 


Intentional error production on the part of the experimenter is 
probably as rare an event in the psychological experiment as it is in the 
sciences generally (Wilson, 1959; Shapiro, 1959; Turner, 1961). Never- 
theless, any serious attempt at understanding the social psychology of 
Psychological research must consider the occurrence, nature, and control 
of this type of experimenter effect. 


In survey research the “cheater problem” is of sufficient importance 
to have occasioned a panel discussion of the problem in the International 
Journal of Attitude and Opinion Research (1947). Such workers as 
Blankenship, Connelly, Reed, Platten, and Trescott seem to agree that, 
though statistically infrequent, the cheating interviewer can affect the 
results of survey research, especially if the dishonest interviewer is 
responsible for a large segment of the data collected. Other workers 
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calling attention to the cheating problem in survey research include 
Crespi (1945-1946), Mahalanobis (1946), and Cahalan, Tamulonis, and 
1947). One way in which the problem of intentional error in 
differs from the problem in the sciences generally is in 
the status, career aspirations, and identifications of the data collectors. 
While most data in the sciences are gathered by professional scientists 
and their professional assistants, the interviewers in survey research are 
often part-time nonprofessionals with less education and less aspiration, 
if any, to identification with a scientific career role. 

A similar situation exists in the laboratory exercises performed as 
part of undergraduate courses in experimental psychology. In such 
laboratory exercises the fabrication of data is commonplace and well 
known to instructors. Positive sanction is sometimes involved for the 
fabrication of "real-appearing" data. The same sort of fabrication occurs 
in undergraduate physics laboratory courses. Students in these under- 
graduate laboratory courses, like interviewers in survey research, do not 

generally have the same commitment to "science," however, that pro- 

fessional scientists or research assistants, scientists in the making, have. 

In two experiments carried out by undergraduate laboratory students, 

attempts were made to check the incidence of dishonesty. In both situa- 

tions, the experiments performed were structured not as just "laboratory 

exercises" but rather as sources of “real data" to be used in the publica- 

tions of “real scientists." In one of these experiments involving the learn- 

ing of rats in Skinner boxes, several instances of data fabrication came to 

light (Rosenthal and Lawson, 1963). In the other experiment, through 
the use of participant observers, а somewhat more precise estimate of 
intentional error production was possible (Rosenthal and Fode, 1963a). 
In that study of discrimination learning in Tats, no instances of actual 
data fabrication were noted, though it is remotely possible that they 
might have occurred. Rather, error production involved the use of 
prodding of animals on a number of occasions. 'These "errors" were 


randomly enough distributed over treatment conditions that the obtained 
means, and more importantly, the differences between means, were 


probably not seriously affected by their occurrence. . | 
Somewhat closer to the situation of the professional, science-oriented, 


data collector is the situation described by Azrin, Holz, Ulrich, and 
Goldiamond (1961). They replicated a verbal conditioning study origi- 
nally conducted by Verplanck (1955) and obtained results similar to his. 
In both studies, experimenters had been able to control the content of 
their subjects’ verbal productions by verbal reinforcement. It developed, 
however, that the formal procedure for experimenters was such that, if 
followed, the experimental effects could not be demonstrated! A number 


Verner ( 
survey research 
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of the experimenters, it was learned, had resorted to the fabrication of 
the data they believed the investigators to want. 


The Control of Intentional Errors 


In discussing the control of intentional errors committed by survey 
interviewers, Crespi (1945-1946) stated that these errors were more a 
problem of morale than of morals. After discussing various ways of 
assessing the data for the likelihood of cheating, he returns to the theme 
that most of all, we need to know why data collectors cheat. Removal of 
the conditions motivating intentional error production seems to be the 
most valuable control of their occurrence. No method of post hoc analysis 
(International Journal of Attitude and Opinion Research, 1947) of 
whether cheating occurred can ever be a satisfactory substitute for 
eliminating the need to cheat. 

Perhaps one long-range improvement in d 
would be the revision of the current practice in un 
courses of telling students what data they 
reward on data honestly, if unskillfull 
"correct" data, skillfully but dishonestl 
be something unresearchm 
at the end of a journey 


ata-collector attitudes 
dergraduate laboratory 
will obtain. Better put a high 
y, obtained than to reward the 
у manufactured. There seems to 
an-like in telling students what they will find 


whose purpose is really to confront a surprise. 


Il. Effects Altering Subjects’ Responses 


А. EXPERIMENTER ATTRIBUTES 


х, race, religion, status, likeability, and warmth all 
have been shown to affect subjects’ 

subjects, adjustment, hostility, anxiety (Winkel and Sarason, in press), 
acquiescence, authoritarianism, and intelligence are other experimenter 
variables found to make a difference (Rosenthal, 1963b). A concrete 
example of a recent experiment investigating the effect of experimenter 
attributes follows. 


The study employed 40 experimenters (almost all male) and 230 
subjects, about half of them male and half of them female (Rosenthal, 


Persinger, Vikan-Kline, and Mulry, 1963e). Each experimenter requested 
about six subjects to rate the apparent s 


pictured in photographs. Thirty-one of th, 


i р t and the Taylor Manifest 
Anxiety Scale before running their subjects. A correlation of .48 (p — .02) 
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was obtained between the experimenters’ anxiety scores and the degree 
of success their subjects saw in the photos of faces. For these same 
experimenters, the correlation between their need for social approval as 
measured by the Marlowe-Crowne Scale and the ratings of success of the 
photos obtained from their subjects was —.32 (p = .10). Thus, subjects 
rated persons as more successful when in the presence of experimenters 
who were more anxious and had a lower need for social approval. (For 
this sample, the correlation between experimenters’ need for social 
approval and their anxiety was —.14.) 

The results reported seemed clear enough and we were quite un- 
prepared for the results of a subsequent experiment (Rosenthal, Kohn, 
Marks, and Carota, 1963a). With 26 male experimenters running a 
total of 115 female Ss in the same photo-rating task we obtained 
diametrically opposed results. This time less anxious experimenters 
obtained ratings of greater success from their subjects (32 = 7.55, p < 01). 
Furthermore, experimenters higher in need for social approval obtained 
more ratings of success (y? = 3.85, р = .05). Apparently then, personality 
variables may interact with replications in partially determining the sub- 
jects’ responses. The most obvious difference between the two experiments 
reported was in their locale. The first study had been conducted at a 
medium-sized state university in the midwest. The second had been con- 
ducted at an east coast private university. Exactly how these differences 
might have served to lead to opposite results remains a question for further 


study. It does seem, however, that where an experiment is conducted can 


make a significant difference in its outcome—even when the principal 


investigator is held constant. 

ODELING EFFECTS 

a special case of the effects of experimenter 
btained from subjects. The specific attribute 
involved here is the quality of the experimenter's own performance at 
the task he subsequently assigns his subjects. The extent to which a 
subject's task performance is predictable from his experimenter's own 
performance of the same task is the extent of the modeling effect. 

While studies of modeling effects have been conducted for some time 
in the field and in the clinic, analogous studies in the laboratory have 
been undertaken only very recently. This may be attributed to a general 
belief that the greater "rough-and-tumble" of the field and clinic might 
naturally lead to increased modeling and related effects. Conversely, the 
greater degree of "control" and "precision" of the laboratory may have 
led to the view that these extrancous effects are less likely to occur. 

The assumptions underlying the development of a program of 


B. EXPERIMENTER M 


Modeling effects are 
attributes on the data o 
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research on laboratory experimenters’ effects were that (1) there is no 
discontinuity of control, precision, or operating processes between field 
studies, clinical studies, and laboratory studies that are a simple function 
of the site of the research, and (2) that there is no discontinuity between 
the processes governing the interpersonal experimenter-subject relation- 


ship in any sort of social research and the processes governing any other 
sort of interpersonal relationship. 


A total of eight experiments hay. 
designed to assess the existence and magnitude of experimenter modeling 
effects. All of these studies employed the same task. Subjects were asked 
to rate a series of 10 or 20 photographs on a rating scale of how success- 
ful or unsuccessful the persons pictured appeared to be. In all eight 
studies, experimenters themselves rated the photos before running their 
subjects. This was accomplished as part of the training procedure, it be- 
ing most convenient to train experimenters by having them assume the 
role of subject while the authors acted in the role of experimenter. For 
each study, modeling effects were defined by the correlation between the 
mean rating of the photos by the different experimenters themselves and 
the mean photo rating obtained by each experimenter from all his sub- 
jects. The number of experimenters (and therefore the N per correlation 
coefficient) per study ranged from 10 to 96. The number of subjects per 


study ranged from 55 to 906. The number of subjects per experimenter 
ranged from 4 to 20, the mean falling between 5 and 6. In all, 1 

menters and over 800 subjects were included (Rosenthal, 1963c). 
All experimenters were either graduate Students or advanced 
undergraduate students in Psychology or guidance. All subjects were 
drawn from elementary college courses, primarily in Psychology but also 
and the humanities, All of 


е been conducted which were 


45 experi- 


had at least two treatment conditions 
be partially transcended by 


Table I shows the correlation obtained in each of th 


€ eight studies 
ers and data later ob 


is пе correlations expressing modeling 
effects, the number of experimenters on which the correlation is based is 
shown for each experiment. We see 


a remarkable inconsiste 
correlations, the range of rhos being fi 


vidually, only the correlation of .65 (р 
zero at even the .10 level, two-tailed. ( 
only one based on data not avail 


ncy of obtained 
rom —.49 to 4.65, Taken indi- 
= .001) differed significantly 
This Particular correlati 
able for closer study by t 


from 
on was the 
he Writer.) 
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However, the hypothesis that this array of correlations could constitute 
à set of chance fluctuations from some "true" value of rhu had to be 
rejected (y? = 20.8, df = 7, p = .005). The same analysis omitting data 
from experiment 2 gave a у? value of 9.80, df = 6, р = .15. It seems more 
likely than not that in different experiments utilizing a person perception 
task there will be significantly different magnitudes of modeling effects, 
which for any single experiment might often be regarded as a chance 


fluctuation from a correlation of zero. 


TABLE I 
Моркыхс EFFECTS BY SERIAL ORDER OF EXPERIMENT 
Experiment N Rho 
1 10 52 
2 24 65 
3 12 18 
4 18 31 
5 18 —.07 
6 26 —.32 
F 12 —.49 
8 25 14 


Table I shows the obtained correlations in the temporal order in 
Which the experiments were performed. There was a tendency for later 
Studies to show significantly more negative correlations than the earlier 
Studies, The correlation (rho) between the algebraic magnitude of cor- 
relation and order of study was .81, one-tailed, p = .03. (Omitting experi- 
ment 2, rho was .75, two-tailed, p = .05.) It therefore appears that in 
later studies the modeling effect significantly decreased algebraically. 
This finding is difficult to explain. From the earlier to the later studies 
there seemed to be no systematic change in the character of the experi- 
menters, subjects, or treatment conditions. Almost a guess is the sugges- 
tion that in later studies experimenters were somewhat more likely to 
Suspect that they themselves were the objects of study and that this 
recognition may have tended to reverse the direction of the effect. 
Experimenters, perhaps, were on their guard to avoid any possible sort 
OL effect and modeling effects may have been so assiduously guarded 
Against as to reverse their effects. Evidence for this “reversal of experi- 


Menter effects” will be summarized later. 


C. EXPERIMENTER EXPECTANCY EFFECTS 
All experimenters have some orientation toward the results of their 
research. Rarely is this orientation one of dispassionate disinterest. 


Variables are not chosen for inclusion in research by using tables of 
random numbers. They are, rather, chosen because the experimenter 
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has certain expectations about the relationship or lack of relationship 
between the selected variables and certain other variables. A superficial 
exception to this might be seen in so-called heuristic hunts for relation- 
ships, which are perhaps more common to the behavioral sciences. Even 
here, however, the inclusion of variables is not on a random basis, and 
certain relationships appear more likely to be found than others. 

We now raise the question of whether an experimenter's expecta- 
tions can affect the data actually obtained in his research. We are not so 
much concerned here with the problem of choice of experimental design 
and the fact that certain designs may unintentionally be more or less 
favorable to obtaining expected or unexpected data. Neither are we 
concerned with the problem of statistical tests of hypotheses and the 
fact that uniquely most powerful statistics may unintentionally be 


employed when the expectation is to be able to reject the null hypothesis, 


while less powerful statistics may be employed when the expectation is 
to be unable to reject the null h 


ypothesis. These are interesting questions 
but will not be considered here. Our usage of "results" or “outcome” will 
be restricted to the raw data obtained by experimenters from tlieir sub- 
jects. What we are talking about then is something very much like 
Merton's (1948) concept of “self-fulfilling prophecy." 
event (ie. an experimental result) 
changes the prophet's (experimenter's) 
the predicted event more likely. 
Evidence for the operation of these expectancy effects has been 
drawn from everyday life, from the field of education, from clinical 
practice, and from the field of survey research (Rosenthal, in press a). 
Until recently, however, there has been little experimental evidence that 
self-fulfilling prophesies may operate even in a carefully controlled 
laboratory research setting. Perhaps the earliest laboratory experiment 
which involved the systematic manipulation of 
expectancies was that by Stanton and Baker 
geometric figures were presented to 


One prophesies an 
and the expectation of it then 
behavior in such a way as to make 


the experimenters’ 
(1942). Twelve nonsense 


è a group of 200 undergraduate sub- 
jects. After several days, retention of these figures was measured by five 


experienced workers. The experimenters were supplied with a key of 
"correct" responses, some of which were incorrect, Experimenters were 
explicitly warned to guard against any bias associated with their having 
the keys before them and therefore influencing their subjects to guess 
correctly. Results showed that the experimenter obtained outcomes in 
accordance with his expectations. When the item on the key was correct, 
the subject's response was more likely to be correct than when the key 


was incorrect. In a careful replication of this study Lindzey (1951) 


emphasized to his experimenters the importance of keeping the keys out 
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of the subjects’ view. This study failed to confirm the Stanton and Baker 
findings. Another replication by Friedman (1942) also failed to obtain the 
significance levels obtained in the original. Still, significant results of this 
Sort, even occurring only in one out of three experiments, cannot be 
dismissed lightly. Stanton (1942) himself presented further evidence 
Which strengthened his conclusions. He employed a set of nonsense 
materials, 10 of which had been presented to subjects, and 10 of which 
had not been. Experimenters were divided into three groups. One group 
Was correctly informed as to which 10 materials had been exposed, 
another group was incorrectly informed, while the third group was told 
nothing. The results of this study also indicated that the materials which 
the experimenters expected to be more often chosen were, in fact, more 
often chosen. 

But the most fascinating and most instructive case of experimenter 
expectancy effects is no doubt that of Clever Hans (Pfungst, 1911). Hans, 
it will be remembered, was the horse of Mr. von Osten, a German 
mathematics teacher. By means of tapping his foot, Hans was able to 
add, subtract, multiply, and divide both integers and fractions. Hans 
could spell, read, and solve problems of musical harmony. To be sure, 
there were other clever animals at the time. There was "Rosa" the mare 
of Berlin, who performed similar feats in vaudeville, and there was the 
dog of Utrecht, and the reading pig of Virginia. AII these other clever 
animals were highly trained performers who were, of course, intentionally 
cued by their trainers. 

Mr. von Osten, however, did not profit from his animal's talent, nor 
did it seem at all likely that he was attempting to perpetrate a fraud. He 
Swore he did not cue the animal and he permitted other people to 
question and test the horse without his own presence. Pfungst and his 
colleague, Stumpf, undertook a program of systematic research to dis- 
cover the secret of Hans’ talents. Among the first discoveries made were 
that if the horse could not see the questioner, Hans was not clever at all. 
Similarly, if the questioner did not himself know the answer to the 


question, Hans could not answer it either. Still, Hans was able to answer 


Pfungst’s questions as long as the investigator was present and visible. 


fungst reasoned that the questioner might in some way be signalling 
to Hans when to begin and when to stop tapping his hoof. A forward 
inclination of the head of the questioner would start Hans tapping, 


Pfungst observed. He tried then to incline his head forward without 


asking a question and discovered that this was sufficient to start Hans’ 


tapping. As the experimenter straightened up, Hans would stop tapping. 
Pfungst then tried to get Hans to stop tapping by using very slight 
upward motions of the head. He found that even the raising of the eye- 
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brows was sufficient to stop the tapping. Even the dilation of the ques- 
tioner's nostrils was a cue for Hans to stop tapping. 

When a questioner bent forward more, the horse would tap faster. 
This added to the reputation of Hans as brilliant. That is, when a large 
number of taps was the correct response, Hans would tap very, very 
rapidly until he approached the region of correctness, and then he began 
to slow down. It was found that questioners typically bent far forward 
when the answer was a long one, gradually straightening up as Hans got 
closer to the correct number. For some experiments, Pfungst discovered 
that auditory cues functioned additively with visual cues. When the 
experimenter was silent, Hans was able to respond correctly 31% of the 
time in picking one of many placards with different words written on 
them, or cloths of different colors. When auditory cues were added, Hans 
responded correctly 56% of the time. 

Pfungst then played the part of Hans, tapping out responses to 
questions with his hand. Of twenty-five questioners, twenty-three un- 
wittingly cued Pfungst as to when to stop tapping to give a correct 
response. None of the questioners (males and females of all ages and 
occupations) knew the intent of the experiment. When errors occurred, 
they were usually only a single tap from being correct. The subjects of 
this study, including one experienced psychologist, were unable to dis- 
cover that they were emitting cues. 

Pfungst summarized the difficulties in uncovering the nature of 
Clever Hans’ talents by speaking of "looking for, in the horse, what 
should have been sought in man." We shall now report the results of a 
continuing program of research on the effects of experimenters’ expect- 
ancies on the data they obtain from their subjects. We have then, in a 
sense, paraphrased and accepted Pfungst's advice. We seek to find in the 
experimenter a portion of what has most often been sought in the subject. 


ПІ. Demonstration of Expectancy Effects: Human Subjects 


The basic paradigm for the study of ex 


perimenter expectancy effects 
has been to create tw 


o or more groups of experimenters with different 
expectancies or hypotheses about the data they would Obtain from their 
subjects. Only the very first experiments carried out will 
some detail (Rosenthal and Fode, 1963b). Subsequent experiments will 
simply be summarized. 


be reported in 


Method 


Fifty-seven photographs of faces ranging in size from 2 X 3 cm to 
5x6cm were cut from a weekly news magazine and mounted on 


3x5 inch white cards. These were presented individually to 70 male 
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and 34 female students, enrolled in an introductory psychology class at 
the University of North Dakota. Ss were instructed to rate each photo 
on a rating scale of success or failure. The scale ran from —10, extreme 
failure, to 4-10, extreme success, with intermediate labeled points. Each 
S was seen individually by one of the Es (RR) who read to each the 
instructions given below. 

Instructions to Subjects. "I am going to read you some instructions. 
I am not permitted to say anything which is not in the instructions nor 
can [ answer any questions about this experiment. OK? 

"We are in the process of developing a test of empathy. This test is 
designed to show how well a person is able to put himself into someone 
else's place. I will show you a series of photographs. For each one I want 


you to judge whether the person pictured has been experiencing success 


or failure. To help you make more exact judgments you are to use this 
rom —10 to +10. A rating of 


rating scale. As you can see the scale runs f 

—10 means that you judge the person to have experienced extreme fail- 
ure. A rating of +10 means that you judge the person to have experienced 
extreme success. A rating of —1 means that you judge the person to have 
experienced mild failure while a rating of 4-1 means that you judge the 
person to have experienced mild success. You are to rate each photo as 
accurately as you can. Just tell me the rating you assign to each photo. 
All ready? Here is the first photo. (No further explanation may be given 
although all or part of the instructions may be repeated.)" 

From the original 57 photos we selected 10 for each sex which met 
the following requirements: (a) their mean rating was close to zero 
(between —1 and +1), (b) their distribution of ratings was not signifi- 
cantly skewed, (c) when the mean ratings for each of the ten cards was 
summed, they summed to zero exactly. Thus we obtained a set of 
"neutral" stimulus-value photos which were rated on the average as 


neither successes nor failures. 

Experimenters. Ten of the eleven students 
graduate experimental psychology served in this capacity. All were 
Psychology majors and three of them were first-year graduate students in 
Psychology. All but two of the Es were males. 

Subjects. Ss were 906 students enrolled in a course in introductory 
Psychology (92 males and 114 females). Because Ss were given class credit 
for participating in the experiment, most of the class volunteered, thus 
reducing the selective effect of using "volunteer" Ss (Rosenthal, in 
press b). Each Æ ran from 18 to 24 subjects. 

Procedure. The E's task was structured as a laboratory exercise to see 
whether they could replicate “well-established” experimental findings as 

students in physics labs are expected to do." Es were told to discuss their 


in a class in under- 
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project with no one and to say nothing to their Ss other than what was 
in the Instructions to Subjects. АП Es were paid a dollar an hour except 
that if they did a "good job" they would be paid double; $2.00 an hour. 
АП 10 Es received identical instructions except that five Es were told 
that their Ss would average a +5 rating on the ten neutral photos. The 
other Es were told that their Ss would average a —5 rating. Thus the 
only difference between the two groups of Es was that one group had a 
plus mark written in the front of the "5" while the other group had a 
minus mark written in front of the “5.” As a warm-up for the Es, each 
of them also rated the standardized set of 10 photos. The exact instruc- 
tions to Es were as follows: 

Instructions to Experimenters. "You have been asked to participate 
in a research project developing a test of empathy. You may have seen 
this project written up in the campus newspaper. There is another reason 
for your participation in this project—namely, to give you practice in 
duplicating experimental results. In physics labs, for example, you are 
asked to repeat experiments to see if your findings agree with those 
already well established. You will now be asked to run a series of Ss and 


obtain from each ratings of photographs. The experimental procedure 


has been typed out for you and is self-explanatory. DO NOT DISCUSS 
THIS PROJECT WITH ANYONE until 


your instructor tells you that 
you may. 


"You will be paid at the rate of $1.00 
your results come out properly— 
instead of $1.00. The 
rating. 


per hour for your time. If 
as expected—you will be paid $2.00 
Ss you are running should average about a —5 


“Just read the instructions to the Ss. S 


hello and goodbye. If for any reason you should say anything to an 5 


other than what is written in your instructions, please write down the 
exact words you used and the situation which forced you to say them 
"GOOD LUCK!" 


ay nothing else to them except 


Results 


The results of this experiment are shown in Table II. The difference 
between the mean photo ratings obtained by Es ex 


| : a iby pecting higher ratings 
and those expecting lower ratings was significant at the .007 level (one- 
tailed p, t = 3.20, df = 8). Data obtained by the two female Es, one in 
each treatment condition, did not differ from the mean ratings obtained 
by the male £s of their respective experimental 


e conditions. The grades 
earned by all Es in their experimental PSychology course were not 


related to either the mean photo ratings obtained from subjects, or the 
А 


magnitude of the biasing phenomenon. 
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Because of the striking nature of the results obtained, we decided to 
undertake another demonstrational experiment (Rosenthal and Fode, 


1963b). 


TABLE II 
EXPERIMENTAL RESULTS UNDER Two EXPECTANCIES 
Expectancy 
+5 =} 
+.66 +.18 
+45 + 
+35 +.04 
+.31 —37 
+.25 —42 
Means: 
+.40 —.08 


Method 


Experimenters. Twelve 
advanced undergraduate cours 
assigned to serve as Es. In this samp. 
Most were majoring in engineering sciences. 

Subjects. Ss were 86 students enrolled in a course in introductory 
psychology (50 males and 36 females). These Ss were also given class 
credit for participating in the experiment. Each Е ran from 4 to 14 Ss. 

Procedure. The procedure of this experiment was just as before with 
the exception that Es did not handle the photos. Instead, each set of 10 
photos was mounted on cardboard and labeled so that Ss could call out 
their ratings of each photo to their E. It seemed reasonable that less 
handling of the photos might serve to reduce the effects of Es’ expect- 
ancies on the data obtained from Ss. As before, half the Zs were led to 
expect ratings of +5, and half were led to expect ratings of —5. 


of the 26 male students enrolled in an 
e in industrial psychology were randomly 
le of Es, few were psychology majors; 


Results 

The results of this second demonstration are shown in Table ПІ. 
Once again, all Es expecting higher ratings obtained higher ratings than 
did any E expecting lower ratings (p = .0005, one-tailed, t = 4.99). 

In this replication Ss tended to rate photos as more successful than 
did the Ss of the first experiment. This difference was greater for the 
“+5” groups of Es (p = .004, two-tailed) than for the "—5" groups 
(р = .08, two-tailed). The second experiment also showed a significantly 
greater difference than the first between the means of the two treatment 
conditions (p — .0005, two-tailed, і = 12.25, df = 9). Why the Ss of the 
replication rated photos as more successful and why they were signifi- 
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cantly more biased by their Es is difficult to interpret. Any one or more 
of four fairly obvious reasons may have contributed to these differences: 

l. Different samples of Es were involved, engineering rather than 
psychology majors. 


2. Different samples of Ss were involved; but these should not have 


differed particularly. 


3. Es did not handle the photos in the second study, as they had in 
the first. 


4. Es were instructed by a faculty member in the first experiment 
and by a graduate student in the replication. 


TABLE III 
EXPERIMENTAL RESULTS UNDER Two EXPECTANCIES: REPLICATION 
Expectancy 
+5 —5 
+3.03 +1.00 
+2.76 +0.91 
+2.59 +0.75 
+2.09 +0.46 
+2.06 +0.26 
+1.10 —0.49 
Means: 
+2.27 +0.48 


Conclusions 


What can we conclude from the two experiments described? It seems 
clear from the data that Es’ expectancies or hypotheses can be partial 
determinants of the results of their experiments. Since Es were not per- 
mitted to say anything to their Ss other than the st 
the communication of E’s biases must hav 
linguistic (e.g., tone) or kinesic (e.g. 


andard instructions, 
€ been by some subtle para- 
facial expressions, gestures) signals. 


answer. One factor is the 


| al life by other than Ph.D. 
psychologists. More and more data appear to be collected by 


assistants. This trend will likely continue as 
continue to obtain increasing financial $ 
Looking now at the pattern for 
appears that the better established h 
has greater financial support for his 
more research assistants. For m: 
researchers, therefore, 


research 
Psychological researchers 
: upport for their research. 
a given psychological researcher, it 
e has become, the more likely that he 
research. He is therefore 


any of the more productive 
the actual data collection 


able to hire 
behavioral 
is often primarily 
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carried out by research assistants. Most of these assistants are graduate 
students but there is an increasing number of undergraduate research 
assistants. This trend is properly endorsed and encouraged by various 
government programs including the Undergraduate Research Participa- 
tion Program of the National Science Foundation. It is for these reasons 
that we feel it to be fairly safe to generalize from our student Es to the 
real life data collector. 

The other factor relevant to the problem of the generalizability of 
our findings assumes for the moment that "real" researchers are Ph.D. 
behavioral scientists. Our position then is that generalizing from ad- 
vanced students in undergraduate or graduate psychology programs to 
"real" researchers is somewhat risky business. Risk of this sort, however, 
is a relative matter and we deem the risk less than the risk of generalizing 
from the behavior of college freshmen or sophomores to “human 
behavior" in general, and much less risky than generalizing from infra- 


human to human behavior. 


IV. Demonstration of Expectancy Effects: Animal Subjects 


Pavlov was aware of the fact that experimenters could influence 
their animal subjects. In speaking of experiments on the inheritance of 
acquired characteristics, he suggested that the noted increase in learning 
ability of successive generations of mice was really more an increase in 
teaching ability on the part of the experimenter (Gruenberg, 1929, 
р. 327), 

Two studies in experimenter expectancy effects have been conducted 
using animal subjects (Rosenthal and Fode, 1963a; Rosenthal and 
Lawson, 1963). In the first experiment, a total of 12 experimenters each 
ran five albino rats on a simple discrimination problem daily for a 5- 
day period. On the basis of ratings of how well they thought they would 
like working with rats, six matched pairs were formed. For each pair, one 
Member was assigned to a group of experimenters who were told that the 
Subjects they were running were maze-bright, while the other member 
Was assigned to a group who were told that their subjects were maze-dull. 
Subjects for the two treatment groups were from a homogeneous colony 
and matched for age and sex. 

Results indicated that on 3 of the 5 days and for the experiment as 
4 whole, experimenters believing their subjects to be bright obtained 
performance from them significantly superior to that obtained by experi- 
Menters believing their subjects to be dull (p = .01). The subjects 
believed to be bright appeared to be learning the problem while those 
believed to be dull did not. These results occurred in spite of the fact 
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that on the level of verbal report both groups of experimenters wanted 
their subjects to perform well. In addition, a rescarch assistant following 
the identical experimental procedure was able to obtain without "cheat- 
ing" performance from her animals superior even to that obtained by 
experimenters believing their subjects were bright. Comparing the degree 
of correlation between what each experimenter specifically expected to 
obtain from his animals and what he actually did obtain from them for 
the "bright" and "dull" groups suggested that these groups were about 
equally biased, although, of course, in opposite directions. 

In the second experiment a total of 38 experimenters were divided 
into 14 research teams each of which had one rat assigned to it. Eight of 
the teams were told that the subjects they would be working with had 
been bred for brightness while the remaining six teams were told that 


their subjects had been bred for dullness. АП subjects were drawn from 
the same animal colony, all were female, and 


Animals were assigned at random to one of the tw 
which were experimenters' beliefs or expectations 

Seven subexperiments including ( 
acquisition, (c) extinction and s 
reinforcement, ( 


all were 80 days old. 
© treatment conditions, 
about subjects’ ability. 
a) magazine training, (b) operant 
pontaneous recovery, (d) secondary 
е) stimulus discrimination, (f) stimulus generalization, 
and (g) chaining of responses were performed. Differences in performance 
favored the groups of experimenters believing their subjects to be bright 
in seven out of the the eight comparisons and these differences could not 
often have occurred by chance (p — +02). There was no trend over the 
course of the experiments for the treatment effects to either increase or 
decrease nor were the performances of subjects in any experiment save 


one correlated significantly with their performances in the subsequent 
experiment, It appeared then, that the Several experiments were, to a 
great extent, independent. Comparisons of 


the treatment. effects among 

each of the five laboratory sections to which experimenters had been 

assigned showed no real difference, all sections showing the mean 

differences in the predicted direction and at similar levels of significance. 

These differences were obtained in spite of the fact that laboratory 
instructors gave more help to ex 


- petimenters whose Subjects were per- 
forming poorly and that all exper 


: Я imenters were motivated to have their 
animals perform well in order to complete the sequence of experiments. 


In addition, a laboratory instructor was present in each labor: 
that gross recording errors and differences in experimenters’ 
of their subjects would have been observed and corrected, 

On the basis of questionnaire data obtaine 
it appeared that experimenters believing their sy 
for brightness were more satisfied with their pa 


atory so 
treatment 


d in both experiments, 
bjects to have been bred 
Tücipation in the experi- 
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ments, liked their subjects more, watched them more intently, and found 
them to be more pleasant. They tended also to be more enthusiastic, 
friendly, encouraging, and pleasant; but were less talkative and less loud 
when working with their subject. Perhaps the most crucial difference 
was these experimenters' handling their animals more. The potentially 
facilitating effect of handling on learning seems well established (Bern- 


stein, 1957). 


V. Effects of Excessive Reward on Expectancy Effects 


rimenters can obtain from their human or 
expect to obtain. The studies described 
bjects offered mild incentives for the 
operation of the experimenter expectancy effect. Experimenters were 
offered $2.00 instead of $1.00 “if their data came out as expected." The 
studies which employed animal subjects offered no incentive for the 
operation of experimenter biasing of their experimental findings. On the 
contrary, these experimenters were motivated by the nature of the situa- 
tion to obtain uniformly good performance from their rat subjects. In 
these studies then, it appeared that an experimenter's expectancy was а 
more powerful determinant of the results of his experiment than his 


desires regarding the results. 
In order for any scientist to do any research, he must be motivated 


to do so, and probably more than casually so. It is, after all, a lot of 
trouble planning and executing an experiment. The motivation to 
conduct research is usually related to certain motivations associated with 
the results of the research. Rarely is the investigator truly disinterested 
in the results he obtains from his research. William James put it thus: 
. science would be far less advanced than she is if the passionate 
desires of individuals to get their own faiths confirmed had been kept 
out of the game . . - if you want an absolute duffer in an investigation, 
you must, after all, take the man who has no interest whatever in its 
results: he is the warranted incapable, the positive fool" (1948, p. 102). 
Two experiments were conducted to investigate the effects of “un- 
usual" motivation on the operation of experimenter expectancy effects 
(Rosenthal, Fode and Vikan-Kline, 1960b; Rosenthal, Friedman, Johnson, 
Fode, Schill, White, and Vikan-Kline, in press. In the first of these 
Studies, 12. graduate student experimenters ran à total of 58 under- 
graduate subjects in the photo-rating task described earlier. All experi- 
menters were given an expectancy about the data they would obtain from 
their subjects. Half the experimenters were moderately motivated to ob- 
tain this data by the promise of $2.00 for “obtaining good data.” The 
remaining experimenters were more extremely motivated by the promise 


We have seen that expe 
animal subjects the data they 
which have employed human su 
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of $5.00 for "obtaining good data." Results indicated that the more 
moderately motivated experimenters obtained data more in accord with 
their expectancies than did more extremely motivated experimenters 
(p — .01). There was, in fact, a tendency for the more extremely moti- 


vated experimenters to obtain data opposite to that which they had been 
led to expect. 


In the second study 30 advanced undergraduate experimenters ran 


a total of 150 subjects in the same photo-rating task. Again, all experi- 
menters were given an expectancy about the data they would obtain 
from their subjects. Half the experimenters were “ 
by telling them that if the data they obtained w 
an unknown partner they wi 
their partner's as well. 


excessively" motivated 
as "better" than that of 
ould be paid not only their own $1.00 but 


If, on the other hand, their "partner" did a 
"better job" he would get their reward as well as his own. Results again 


suggested that more moderately motivated experimenters obtained data 
more consistent with their expectancy. Again, more extremely motivated 
experimenters tended to obtain data opposite to what they had been led 
to expect. The results of this second Study were not so statistically sig- 
nificant, but the two studies taken together yielded a p of .05. 

Why might the effect of excessive incentive to bias the subjects' data 
be to reduce or even to reverse the biasing effect of the experimenters? In 
à postexperiment group discussion with the experimenters of the second 


reported study, many of them seemed somewhat upset by the experi- 
mental goings-on. Several of them used the term “ 
that they felt that the inv 
data which was, in a sense, 


payola," suggesting 
estigators were bribing them to get "good" 


true. Since money had been mentioned and 
dispensed to only the more motivated experimenters of this study, it 


seems likely that they were the ones perceiving the situation in this 
way. Kelman (1953) found that subjects under higher motiy 

form to an experimenter showed less such c ап did subjects 
under lower conditions of motivation. One eral of Kelman's inter- 
pretations was that the subjects who were rewarded more may have felt 
more as though they were being bribed to conform for the experimenter's 


own benefit, thus making subjects suspicious and resentful, апа there- 
fore less susceptible to experimenter influence. This interpretation fits 
the present situation quite well. 


ation to con- 
onformity th 


of sev 


VI. Subjects’ View of Biased Experimenters 


Riecken has discussed the social nature of the €xperimenter-subject 
interaction in some detail (1962). An important aspect of this inte 


Е raction 
is the subject's perception of the experimenter. What sort of person is the 
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experimenter seen to be in his relationship to the subject? The answer 


to this question has general relevance to an increased understanding of 
the social psychology of the psychological experiment. More particularly, 
is a more biased experimenter seen as a somewhat different person than 


a less biased experimenter? The answer to this question might increase our 


understanding of the mediation of experimenter expectancy effects. 


Subjects’ Perceptions of Experimenters’ Behavior 
of the experiments involving the photo- 
rating task each of 56 subjects completed a questionnaire describing the 
behavior of his experimenter during the conduct of the experiment. 
Twelve experimenters completed the same forms describing their own 
behavior during the experiment. Neither experimenters nor subjects 
knew beforehand that they would be asked to complete these question- 
naires, and no one save the investigators saw the completed forms (Rosen- 
thal, Fode, Friedman, and Vikan-Kline, 19602). These forms consisted of 
27 20-point rating scales ranging from —10 (eg. extremely discourteous) 
to +10 (e.g., extremely courteous). 
Mean ratings of the experimenters by their subjects and by them- 
selves reflected very favorably on the experimenters. The profile of the 


€Xperimenters as they were viewed by subjects showed remarkable simi- 
imenters as viewed by themselves (rho — 


larity to the profile of the experi 
89). To summarize and facilitate interpretation of the obtained ratings, 
all variables were intercorrelated and cluster analyzed (see Table IV). 


Four clusters emerged for which the associated B coefficients were all 
considerably larger than generally deemed necessary to establish the sig- 
nificance of a cluster. For mnemonic purposes we may label Cluster I as 
"Casual-Pleasant," Cluster II as "Expressive-Friendly," Cluster ПІ as the 
"Kinesic Cluster," and Cluster IV as “Enthusiastic-Professional.” 

ters differ in the degree to which their expectancy is 
predictive of the data they obtain from their subjects. We could therefore 
ask the question of whether those experimenters whose expectancies de- 
termined their results more were perceived by subjects as behaving dif- 
ferently during the course of the experiment. Magnitude of expectancy 
eflect or bias was therefore correlated with all the ratings subjects had 
made of their experimenters’ behavior. 

The median correlations with degree of experimenter bias of the 
variables in Clusters I and IV were .26 and .21, respectively; neither was 
significantly greater than a correlation having a p — .50. The median 
correlations with degree of experimenter bias of the variables in Clusters 
II and III were .47 and .43, respectively. Both of these median correla- 


At the conclusion of one 


Experimen 


100 ROBERT ROSENTHAL 


tions were significantly greater than a correlation to be often expected by 


chance; two-tailed p's were .04 and .01, respectively. More biased experi- 

menters, then, were characterized by higher loadings on the “Expressive- 

Friendly" and the "Kinesic" clusters. These findings suggest that kinesic 
TABLE IV 

CLUSTER ANALYsIS ОЕ SUBJECTS’ PERCEPTION oF Ex 


Cluster I: B = 648 
Honest 


PERIMENTERS 


Cluster П: В = 3.97 


Liking 
Casual Friendly 
Relaxed Personal 
Pleasant Interested 
Courteous Encouraging 
Business-like 


Expressive face 
Expressive-voiced 

Use of hand gestures 
Satisfied with experiment 


Slow-speaking 
Pleasant-voiced 
Behaved consisten tly 


Mean rating = 5.91 


Mean rating = 2.57 


Cluster ПІ: В = 910 Cluster IV: В = 3.55 


Enthusiastic 
Professional 
Quiet. (nontalkative) 


Use of head gestures 
Use of arm gestures 
Use of trunk 

Use of body 

Use of legs 


Mean rating — 


96 Mean rating = 3.01 


and possibly para ing 
his subjects serv 
jects. 


uistic aspects of the experimenter’s interaction with 
€ to communicate the experimenter’s bias to his sub- 


Subjects! Predictions of Experimenters Computational Errors 
АП experimenters 


are rated more so than 


1 : Were rated as significantly 
less honest (p — .02) by their subjects during the course of data collection. 


Т experimenters’ computa- 
enters’ behavior during the 


о h we Presently have no an- 
swer. It seems clear, however, that subjects learn а good deal about their 


experimenter in the brief interaction of the Person-perception experi- 
ment conducted. Furthermore, when a subject rates the behavior of an 
experimenter, we may do well to take his rating Seriously. It may prove 


to be a valuable predictive, postdictive, or paridictive vatisbte fni 
thal et al., in press). 
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УП. Interaction of Subjects' and Experimenters' Expectancies 


described so far, and those to be reported here, 


In the experiments 
manipulated. In 


experimenters’ expectancies have been experimentally 
this section we shall describe the additional effects of the simultaneous 
experimental manipulation of subjects’ expectancies. 

Riecken (1962) has discussed the subjects’ "deutero-problem" in the 
psychological experiment, which exists beyond the problems of simply 
performing the formal task required of him by the experimenter. The 
most favorable solution to the deutero-problem would involve the maxi- 
mal satisfaction of the subjects three aims in the experiment: (1) the 
attainment of some sort of reward, (2) the divination of the experimenter's 
true purposes, and (3) the favorable presentation of self for the experi- 
menter’s scrutiny. Those stimuli in the experimental situation which 
provide cues to the subject as to how he may best solve his deutero- 
problem have been called “demand characteristics” (Orne, 1962). One of 
the major purposes of the two studies to be reported here was to investi- 
gate the role of subtle and of blatant demand characteristics in determin- 
ing the results of psychological research (Rosenthal, Persinger, Vikan- 
Kline, and Fode, 1963d; White, 1962). 

In the first of these studies, 18 experimenters positively reinforced 
a total of 65 subjects for high positive ratings of the success of persons 


pictured on photographs. Half the experimenters were led to expect high 
rates of awareness from their subjects, while the remaining experimenters 
were led to expect low rates of awareness. Each of these two groups was 
further divided into a group of experimenters whose subjects' instructions 
favored their "seeing through" the experimental situation and a group 
whose subjects received standard instructions. An additional group of 
four experimenters running a total of 26 standard-instruction subjects 
Was used as a control group. These experimenters were biased to expect 
high photo ratings but did not reinforce any subject’s response. 

ased to expect higher rates of awareness obtained 
higher rates of awareness (р = .07) and subjects who had been given a 
set to “see through” the experiment tended to be more often aware, 
though this effect was not as reliable as the effect of experimenter's bias. 
The obtained effects were most powerful and most significant statis- 
tically (p = .05) when they were operating conjointly. Subject set was, 
however, significantly related to magnitude of conditioning with sub- 


Jects set to "see through" conditioning less. Sets favoring "seeing through" 
an experiment may carry an implied set to not conform to more super- 


ficial demand characteristics of the expert 
18 experimenters ran à total of 108 sub- 


Experimenters bi 


mental situation. 


In the second experiment, 
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jects in the photo-rating task described earlier. : | 
(ranging from —6 to +6) were induced among both experimenters аш 
subjects. The expectancies created in subjects were far less subtle € 
was the case in the preceding study. Results suggested the operation 9 
contrast effects wherein subjects confronted with experimenters of radi- 
cally different expectancies seemed to make more entrenched responses. 
In addition to this contrast effect and regardless of the direction of the 
expectancies involved, subjects tended to rate the photos as more success- 


ful when their expectancy and that of their experimenter were in greater 
accord. 


Six different expectancies 


These studies served to increase the generality of the findings bear- 
ing on the effects of experimenters’ and subjects’ expectancies. In addi- 
tion, however, they indicated that more subtle demand characteristics 

responses made by subjects. As with 
unusually motivated experimenters, unusually "expectant" or cued sub- 


jects provided data less directly predictable from a knowledge of experi- 
menters’ hypotheses. 


ҮШ. Effects of Experimenter Expectancy: Negative Instances 


The title of this section m 


until the work reported here was accomplished, there was only unequi- 
vocal and highly predictable anc 

question of the effects of experi ’ expectancies on the results of 
their research, Nothing could be further from the truth. The results of 
the first experiment described in the section on th 
reward were at least in part quite unpredic 
made sense but it was, after all, post hoc. Wi 
experiment reported in that 
initially “negative” е only with respect to our 
initial predictions. They made sense after further data collection and 


White's (1962) experimental data, ast section, 
similarly did not conform to our expectation. There Was no neat regres- 
sion line ascending from congruent experimenter-subject expectations 
for negative data to those expectations congruent for Positive data, In 
опе sense, that study also provided a neg. 


ative instance, Моге data are 
needed to make conceptual good sense of 


a hopefully orderly phenome- 
non. As we turn then to a consideration of "negative" instances, we must 


bear in mind that they are neither the first nor the last such instances, 
Nor does their “negative” nature imply that we cannot learn from them 
something useful about the phenomenon under discussion, 


Summarized in the 1 
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In the first of the two studies to be summarized here, eight experi- 
menters administered an intelligence test (Block design of the WAIS) to 


to a total of 32 subjects (Wartenberg-Ekren, 1962). Experimenters were 


to be led to expect that half their subjects would perform well and half 


would perform poorly. 
Results of this study showed no effect of experimenters' expectancies 


on the data obtained from subjects. Wartenberg-Ekren's analysis of ex- 
aires revealed that all experimenters expected the 


perform better. In addition, none of the experi- 
had treated their subjects differentially as a func- 
tion of their perceived group membership. The over-all negative findings 
may have been due in part to the fact that one of the eight experimenters 
intentionally instituted a double blind procedure to avoid bias! He 
simply did not look at the code identifying the treatment condition of his 
next subject. Another experimenter was “blind” for half his subjects, 


although not intentionally so. 
Subjects’ ratings of their experimenters 1n the Wartenberg-Ekren 


Study were not directly comparable to those of our earlier studies because 
she used a three-point scale rather than our 20-point scale. We were able, 
however, to group many of her scales into two of our four earlier derived 
clusters: Cluster I (Casual-Pleasant) and Cluster II (Expressive-Friendly). 
Subjects labeled as brighter did not rate their experimenters any dif- 
ferently on the scales included in Cluster I. These subjects, however, rated 
their experimenters as significantly higher on Cluster 11 variables than 
did subjects labeled as less bright (p = .03). This suggests that although 
all experimenters believed they treated all subjects alike, they made some 
distinction in their treatment of subjects as a function of their perceived 
group membership. We should emphasize that not only were subjects un- 
aware of which group they belonged to, indeed, they were unaware that 
they belonged to алу group- It seems safe to accept their ratings of their 
experimenter as reflecting real differences in experimenters’ treatment of 
subjects. The Cluster II variables included in the Wartenberg-Ekren study, 
and on which “brighter” subjects rated their experiments as higher, 
were friendly, likeable, interested, encouraging, expressive-faced, and use 
of hand gestures. Apparently even the most careful conscious safeguards 
against differential treatment of subjects in different treatment conditions 


may prove ineffective. . "- 
The second study to be summarized in this section 1s one conducted 


by Pflugrath (1962). He had three sets of three graduate student coun- 
selors administer the Taylor Manifest Anxiety Scale to a total of 142 
subjects. One set of experimenters Was led to believe their groups of 
Subjects were very anxious, another set Was led to believe their groups of 


perimenters’ questionn 
brighter" subjects to 
menters felt that they 
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subjects were nonanxious, and the third set w: 
groups of subjects. 


The over-all analysis of the results showed no significant differences 
in anxiety scores earned by the subjects of the three groups of exper 
menters. Among the subjects run under the condition of their experi- 
menter believing them to be highly anxious, a significantly greater 
proportion scored as relatively less anxious than the subjects of the 
control group (p = .09). This finding was unpredicted. Nevertheless, the 
phenomenology of the situation must be considered. Here we have a group 
of counselors-in-training told nothing about their subjects—the control 
group. Another group of counselors is told their subjects are not an- 
xious—so much the better for the subjects! But when counselors are 
told they will be testing very anxious subjects who even have had con- 
tact with the counseling center, might not all their counseling skills be 
subtly brought to bear upon the task of reducing their subjects' anxiety? 
This interpretation seems so plausible, at least on a post hoc basis, that 
we wonder why we did not predict it in the first place. 

The simplest conclusions to dr 
discussed in this section might be tl 
affect subjects’ intelligence test or ar 
may be a large array of tasks whic 
effects of experimenter expect 
resistant is a question deservi 
however, about the particula 


as told nothing about their 


aw from the results of the experiments 
iat experimenters’ expectancies do not 
nxiety scale performance, Indeed, there 
h will prove relatively resistant to the 
ancies. What sorts of tasks are more or less 
ng further research. Some questions remain, 
T two experiments summarized. To what ex- 
tent was the absence of an expectancy effect in the intelligence test study 
due to the intentional and unintentional instituting of a double-blind 


procedure, which if it remains truly blind, must eliminate expectancy 
effects? 


Less in the nature of a question and more 
potential power of expectancy effects, we hav 
that experimenters treated them differentiall 
menters' expectancies, the existence of w 
jects, and this in a context of tremend 
need for objectivity, stand 
subjects. 

Whether therapeutically oriented experimenters 
generally try to reduce the anxiety of those subjects perceived to be 
anxious is a question deserving of further research. Nor only in ad- 
ministering anxiety scales, but in a great deal of contemporary social 
research, subjects are exposed to conditions believed to make them anx- 
ious. What might be the effect on the outcome of experiments of this 
sort of the covert therapeutic zeal and/or skill of various investigators 


as an illustration of the 
e the subjects’ word for it 
Y as a function of the experi- 
hich were unknown to the sub- 
ous emphasis on е 


| тте Xperimenters' 
ardization, and nondifferentia] 


treatment. of 


їп the socia] sciences 
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carrying out this research? Might certain investigators typically conclude 
"no difference" because they tend to dilute the effects of treatment con- 
ditions? And conversely, might others be led to conclude "significant 
difference" by their increasing the anxiety of subjects known to belong 
to the “more anxious condition" of an experiment? Clearly, these are 
not necessarily effects of experimenters’ expectancies, but they are effects 
of experimenter attributes which may have equally serious implications 


for how we do research in the social sciences. 


IX. The Effect of Early Data Returns 


We have been discussing the effects of experimenters’ expectancies 
on the data they obtain from their subjects. We shall discuss here the 
effect of the data experimenters obtain upon their expectancies. Except 


in the most exploratory or heuristic experimental enterprise, the experi- 
menter's expectancies are likely to be based upon some sort of data. It 
al data. It may be based upon a nearly 


need not be formal, experiment 
Casual observation of behavior. The observation need not even be the 
experimenter's own. If data are in any case the most likely determinant 
of experimenter expectancies, then we may fairly ask: What about the 
data obtained early in an experiment? What are their effects upon data 
subsequently obtained within the same experiment? 

That the “early returns” of psychological research studies can have 
an effect on experimenters’ expectancies was noted and well discussed by 


Ebbinghaus (1885). After saying that investigators notice the results of 
, "Consequently it it unavoidable 


their studies as they progress he stated dio 
that, after the observation of the numerical results, suppositions should 
arise as to general principles which are concealed in them and which 
Occasionally give hints as to their presence. As the investigations are 
Carried further, these suppositions, as well as those present at the begin- 
ning, constitute a complicating factor which probably has a definite in- 
fluence upon the subsequent results" (p. 98). He went on to speak of the 
pleasure of finding expected data and surprise at obtaining unexpected 
data and continued by stating the hypothesis of the present studies: 


Where “average values" were obtained initially, subsequent data would 
tend to be less extreme and where "especially large or small numbers are 
expected it would tend to further increase or decrease the values" (p. 29). 

Ebbinghaus was, of course, speaking of himself as both experimenter 


and subject. Nevertheless, on the basis of his thinking and of the reason- 
ing described earlier, it was decided to test Ebbinghaus's hypothesis of 


the effect of early data returns on data subsequently obtained by experi- 


menters. 
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Two experiments were conducted. In the first of these studies (Rosen- 
thal et aL, 1963c) 12 biased experimenters, each running six subjects 
on the photo-rating task, were equally and randomly divided into three 
treatment conditions. One group of experimenters obtained “good” or 
expected data from their first two subjects (who were actually accom- 
plices), another group of experimenters obtained “bad” or unexpected data 
from their first two subjects (who were also accomplices), while the third 
group, utilizing only naive subjects, served as a control. Comparisons 
were made of the mean data obtained by experimenters from the last four 
naive subjects run. 

Results indicated that experimenters obtaining “good” initial data 
also obtained good subsequent data. Experimenters obtaining "bad" ini- 
tial data obtained bad subsequent data. 

In the second study (Rosenthal, Kohn, Marks, and Carota, 1963a), 
26 experimenters each running about six subj 
opposite experimental expectancies. By the 
the earlier study, half the experimenters had t 
by their first two subjects and half had their 
Within each of the conditions described, h 
praised for their experimental technique 
of this study were more complex than th 
showed that experimenter expectancies 
of experimental results when they 
turns. This effect depended, how; 
tor who administered the praise 


ects were given one of two 
same method employed in 
heir expectancies confirmed 
г expectancies disconfirmed. 
alf the experimenters were 
and half were reproved. Results 
ose of the first experiment, but 
were most clearly determinants 
were confirmd by the early data re- 
ever, on the personality of the investiga- 
or reproof to the experimenter. 
X. The Mediation of Experimenter Expectancy Effects 
Granting the occurrence and some 
pectancy effects, how do they operate? Cheating cannot reasonably account 
for the observed effects since at least those instances of cheating of which 
we have become aware tended on the whole to diminish the biasing effect, 
as when experimenters who believed their rat subjects to be dull prodded 
them, and, in a very few cases, presented fraudulent data, Experimenters' 
data recording and computations were checked and were found to be so 
accurate that, in general, written recording and computational errors 
could not be reasonably implicated as agents of expectancy effects. If 
neither cheating nor honest errors could account for our findings, what 
might? 


generality of experimenter ex- 


A. VERBAL CONDITIONING 


The most obvious hypothesis seemed to be some form of verbal 
conditioning. If an experimenter expects to obtain high ratings of photo- 


graphs might he not subtly reinforce this type of response? Conversely, 
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if he expects low ratings of photos, might he not reinforce subtly those 


responses which are low? He might be capable of this system of subtle 
reinforcement even without any implication of dishonesty, for it might 
be an unintended response on his part. Fortunately, we were able to test 
this hypothesis. If indeed verbal conditioning were mediating the phe- 
nomenon, we might expect to find that expectancy effects increase as a 
function of the number of photos rated. Certainly, we would not expect 
to find any biasing on the very first photos rated by subjects run by dif- 
ferent groups of experimenters. There had, after all, been no reinforce- 
ment possible prior to the very first response. In a test of this hypothesis 
(Rosenthal et al., 1964) we found, if anything, that biasing decreased 
over the course of the photo ratings. Furthermore, there was a significant 
biasing effect in evidence on the first photo alone, thus ruling out verbal 
conditioning as a necessary mediator or even аз an augmentor of the 
phenomenon. An important implication of this finding is our need to pay 
Special attention in our search for the mediators of the expectancy bias 
phenomenon to the brief pre-data-gathering interaction during which the 
experimenter greets, seats, "sets," and instructs tbe: subject. | 

A subsequent study (Fode, Rosenthal, Vikan-Kline, and Persinger, 
1961) was conducted to learn whether operant conditioning could drive 
the ratings of photos up ОГ down according to the will of the experi- 
menter, Results showed clearly that this was possible, and that it worked 
best with certain types of subjects. We may therefore conclude that, while 
verbal conditioning is neither а necessary nor necessarily frequent ante- 


cedent of biasing, it nevertheless could be. 


B. Mopaurrv ОЕ CUE COMMUNICATION 
erbal cues, such as tone, more important for the 
ode (1960) studied this question by using a group 
of experimenters behind screens to eliminate visual cues, and a group 
of experimenters who remained silent throughout the experiment to 
eliminate verbal cues. He found that verbal cues of tone are probably 
sufficient to mediate expectancy effects but that they can be greatly aug- 
mented by visual cues. Restriction of visual cues accounted for about 


80% of the variance of bias m 


Are visual or v 
mediation of bias? F 


agnitude. 


C. PERSONAL CHARACTERISTICS OF EXPERIMENTERS AND SUBJECTS 
able to suppose that it is the behavior of the experi- 
menter in the experimental situation that determines the occurrence and 
magnitude of expectancy effects. Without knowing exactly what occurs 
in the experimenter-subject interaction we can only posit that certain 
personal characteristics of experimenters are likely to be associated with 
different behavior vis-à-vis the subject in the experimental situation. As 


It seems reason 
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a first step, 
characterist 
effects. 
Experimenter's need for approval 4 
Marlow-Crowne Social Desirability Scale (M-C SD) appears to predict 
expectancy effects in a fairly reliable way. In seven samples (Persinger, 
1962; Rosenthal, 1963d; Rosenthal €t al., 1963b), experimenters' need for 
approval was correlated with the magnitude of their expectancy effects. 
For a total of 57 experimenters of medium anxiety level (or unselected 
for anxiety) and previously not acquainted with their subjects, the ob- 
tained correlations averaged .64 (p — 001). A recently completed study, 
however, yielded contradictory data (Rosenthal et al., 1963a). In that 
study experimenters with higher need for approval scores showed less 
expectancy effects (р=.04). Disregarding that finding for the moment, 
it appears that experimenters who may have a greater need to please 
the principal investigators obtain more of the sort of data they are led 
by him to expect; at least when their anxiety level is average. For a total 
of 50 experimenters scoring either very high or very low on the Taylor 
Manifest Anxiety Scale (MAS) the correlations between need for approval 
and expectancy effects tended to be negative though not significantly so. 
In none of our studies have we obtained a significant relationship between 


subjects’ need for approval and their Susceptibility to experimenter ex- 
pectancy effects. 


Experimenter’s anxiety 1еуе 
pectancy effect in a remarkably inconsistent manner, 
ments, experimenters with medium levels of anxi 
expectancy bias (Fode, 1963; Rosenthal et al., 
1963b). In two experiments, 
showed the greatest expectancy bias 
et al., 1963e). Finally one experiment 
in anxiety to have the greatest expec 
confusion of findings is matched by th 
tween subjects' level of anxiety and their sus 


therefore, we have tried to find some more enduring personal 
ics associated with greater or lesser experimenter expectancy 


as measured primarily by the 


1 (MAS) has been found related to ex- 
In three experi- 


anxiety 


» 1962; Rosenthal 
showed experi: 


tancy bi 


4 n 1 xious subjects were 
most susceptible (Fode, 1963; Vikan-Kline, 1962). In two other studies 


highly anxious subjects were most susceptible (Rosenthal et al., 1963e; 
Rosenthal et al., 1963b). In a single study, least anxious subjects were found 
most susceptible (Persinger, 1962) and in another, medium anxious sub- 
jects were found least susceptible (Rosenthal et al., 19632). Mietin only 
conclude that experimenters’ and subjects’ anxiety level are related sig- 
nificantly to expectancy effects in an as yet indeterminate manner, 

In most of our studies we have employed primarily male experi- 
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menters. In three experiments enough female experimenters were in- 
cluded to allow us to compare the magnitude of expectancy effects for 
male and female data collectors. In all three cases male experimenters 
showed the greater expectancy effects (Marcia, 1961; Persinger, 1962; 
Rosenthal et al., 1963b). It may be that the culturally prescribed relative 
assertiveness of males may make them more successful sources of unin- 
tended social influence. Again on the basis of role prescription we might 
expect female subjects to be more susceptible to the unintended social 
influence process implied by expectancy effects in the experimental 
situation. In several studies we found no difference between male and 
female subjects in this regard. In three experiments where significant 
differences were obtained, however, we did find female subjects more sus- 
ceptible to expectancy effects (Persinger, 1962; Rosenthal et al., 196b; 
Rosenthal et al., 1963c). These findings were heartening in their con- 
sistency within themselves and with the general, analogous findings in the 
more usual literature on susceptibility to interpersonal influence processes. 
anceship between male experimenters and their sub- 
jects seemed to facilitate the expectancy biasing phenomenon. In one 
experiment this appeared to be true for both male and female subjects 
(Rosenthal et al., 1963e), but in a second study (Persinger, 1962) it ap- 
peared true only for female subjects. The perceived status of the experi- 
menter was also found to be related to his degree of biasing. Vikan-Kline 


(1962) found higher-status experimenters better able to bias their subjects’ 


responses. 
If, on the basis of the data av 


Prior acquaint 


ailable to date, we were forced to de- 
scribe the paradigm fostering maximal experimenter expectancy effects, 
ate an experimenter with a high need for social approval 
vel neither very high nor very low. The experi- 
menter would have high status, be gesturally inclined, and behave in a 
friendly, interested manner vis-à-vis his subjects. Subjects might best be 
eir experimenter and be female rather than male. 

ribed might be understood best by considering the 


experimental dyad as à signal exchange system. The signals under dis- 
of course, unintentional. Experimenters high in need for 


Social approval may typically be more precise in their signaling behavior. 
The business of impression management (Goffman, | 1956), or signal 
editing, is more important to them in their everyday life. Their motiva- 
tion for biasing may also be greater because of their need to please the 
source of the hypothesis. The high status and friendly manner may serve 
to focus subjects’ attention onto the signal source and increase the likeli- 
hood of the experimenter's unintentional message being understood. 


we would postul 
and with an anxiety le 


acquainted with th 
The pattern desc 


cussion are, 
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XL Methodological Implications of Studies of Expectancy Effects 


It seemed reasonable to conclude from our findings that systematic 
expectancy effects might be eliminated by employing as data collectors, 
research assistants who did not know the experimenters’ hypothesis or 
expectancy. Not only did this technique seem logically implied by our 
data, but it would be practical as well. 
actually carried out by research 


More and more data collection is 
assistants. We decided, however, to test 
the soundness of this methodological suggestion (Rosenthal et al., 1963e). 

We began by conducting a by now 
experimenter expectancy effects. Fourteen experimenters ran a total of 
76 subjects in the photo-rating task with half the experimenters led to 
expect +5, and half led to expect —5 mean ratings of the success of per- 
sons pictured in photos. At the conclusion of this experiment, each experi- 
menter was awarded a “research grant” from which he could draw a small 
salary and also hire two research assistants. Assistants were randomly as- 
signed to experimenters who then trained and supervised their two assist- 
ants. Each of these then ran 5 or 6 randomly assigned subjects of his own. 
Unlike the original instructions to experimenters, instructions to their 
assistants did not inform them as to what perceptions to expect from their 
subjects. Experimenters, however, were subtly led, by their printed in- 
structions, to expect their assistants to obtain data of the same sort they 
had themselves obtained from their own subjects. 

Experimenters biased their subjects, and assistants in turn biased 
their subjects. The correlation between magnitude of experimenters’ 
bias and their respective pair of assistants’ bias was .67 (р = .01). Ap- 
parently, the experimenters' hypothesis or expectancy may be com- 


municated to his research assistants without the assistants being told the 
nature of the hypothesis or expectancy. 


What methodological suggestions rem 
to reduce or eliminate experimenter expect 
in which it is possible to do so, the experimenter might eliminate himself 
and his surrogates from the interaction with subjects, 
make this feasible for some kinds of behavioral research, but not for 
others. Any technique of instruction of data collectors by the principal 
investigator which would eliminate the possibility of the subtle com- 
munication of his expectancies would be a methodological improvement. 
This would be no easy matter and no perfect solution, The too frequent 
failure of the double-blind method in medical research attests to this. 
It is a failure not of “double-blindness” but of maintenance of “blind- 
ness." During the experimenter-subject interaction each may 
much about the other to insure "blindness-maintenance." 


fairly standard. experiment. in 


ain then which might serve 
ancy effects? For those studies 


Automated setups 


learn too 
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Not only because of the danger of bias, but also because of the 
general nature of experimenter differences, it would be desirable to em- 
ploy samples of experimenters drawn as randomly as possible from a rele- 
vant population of relevantly uninformed experimenters. Following 
Brunswik (1956), this would greatly increase the generality of all our 
research findings and thus be of benefit even if no bias were ever oper- 
ating. Alternatively, there may be value in employing samples of experi- 
menters with known distributions of bias as Mosteller (1944) has suggested 
in the case of interviewers. The particular biases, however, need not be 
pre-existing ones and it may be useful to purposefully induce different 
biases in our sample of experimenters, giving us better control over the 
nature and degree of experimenters’ biases. That this is a practical tech- 
nique has been demonstrated recently by Rosenhan (personal communica- 
tion), who had his own research repeated by an experimenter in whom the 
opposite hypothesis had been ingeniously induced. 

One fairly concrete implication for the design of experiments may 
emerge from our research, the addition of "expectancy control groups.” 
In any study employing an experimental (treatment) and a control (no 
treatment) condition, а group would be added for whom the experi- 
menter(s) is reasonably led to expect the same sort of data as is expected 
from the treatment group but in which the treatment 15 not administered. 
Differences between the treatment group and the expectancy control 
group might then be attributable to the treatment truly or to a treatment 


and expectancy interaction rather than to expectancy alone. 


XII. Some Substantive Implications of Studies of 


Expectancy Effects 
Perhaps the most compelling and the most general conclusion to be 


drawn is that human beings can engage in highly effective and influential 
Jed communication with one another. The 


unprogrammed and unintenc 
subtlety of this communication is such that casual observation of human 
dyads is unlikely to reveal the nature of this communication process. 


Sound motion pictures may provide the necessary opportunity for more 
leisurely, intensive, and repeated study of subtle influential communica- 
tion processes. We have obtained sound motion picture records of 29 ex- 


perimenters each interacting with several subjects. Preliminary analyses 
pe that we may be able to learn something of 


have given us cause to ho 
ancy effects in particular, and 


consequence about the mediation of expect 


about subtle communication processes in general. In these films, all ex- 
ords to their subjects so that the burden of 


perimenters read identical w 
essions, and intonations which 


communication falls on the gestures, expr 
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accompany the highly programmed aspects of experimenters' inputs into 
the experimenter-subject interaction. 


Some interesting practical questions arise from these considerations. 
When an experienced physician or psychotherapist tells the neophyte 
therapist that the neophyte's patient has a poor or good prognosis, is the 
experienced clinician only assessing or is he actually "causing" the poor 
or good prognosis? When the master teacher tells his apprentice that а 
Pupil appears to be а slow learner, is this prophecy then self-fulfilled? 
When the employer tells the employee that a task cannot be accomplished, 


has the likelihood of its accomplishment thereby been reduced? More sub- 
tly, might these phenomena occur even if the supervisors never verbalized 
t that they may. 


their beliefs? The data cited sugges 
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I. Introduction 

n of the early work of Pavlov (1927), which 
described marked individual differences in the conditioning performance 
of animals, the method of conditioning has held promise of elucidating 
basic parameters of human personality functioning. Early attempts to 
apply the method to human subjects encountered serious methodological 
difficulties. Apart from such technical problems as the measurement of 
the output of saliva in salivary conditioning of human subjects, it be- 
came apparent that the central factors of "attitude," "set," or “ехрес- 
tancy" are important sources of variability in human conditioning. The 
conditioning of other responses, such as the GSR, brought new problems 


of measurement. It is probably true, however, that the most significant 
ather than methodological. In the 


ng period in which little advance 
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was made on Pavlov's thinking in relation to the problems of individual 
differences in conditioning. In the Western world this aspect of Pavlov's 
theorizing had little impact, and there were no alternative theories which 
might have systematized the findings of existing research, and pointed 
the way to significant areas of future study. 

The resurgence of Western interest in problems of conditioning 
and personality over the past decade or so seems to have resulted chiefly 
from the development of two psychological theories of relevance to prob- 
lems of personality. These are the theories of Hull (1943, 1952) and 
Eysenck (1957). Between them these two theories have inspired the 
majority of Western studies of conditioning in relation to personality 
functioning over the past decade. Most of the remaining Western in- 
vestigations have been largely empirical in character. In the USSR, in- 
terest continues in the problem of developing Pavlov's typological con- 
ceptions. 

А review of recent research suggests that studies of personality dyna- 
mics through conditioning may most conveniently be grouped under two 
main headings. First, there are the studies which begin with groups dif- 
ferentiated on the basis of personality characteristics, and proceed to 
examine the relationship between these personality characteristics and 
parameters of conditioning performance. Possibly the most important 
subgroupings within this class of investigation may be defined in terms of 
the theoretical bases from Which the studies derive. Thus the study may 
be designed to test predictions from a theory of personality of wide 
generality, it may be concerned with predictions derived from a theory 


of limited aspects of personality functioning, Or it may be unrelated 
to systematic theory of any sort. 


In the second group of studies, tect 
in the process of defining major personality variables, V 
vestigations of personality and conditioning in Western 
into the first major class, whereas the most 
USSR belong to the second major grouping. 

A further development leading to renewed interest in 
conditioning relations has been the application of 
methods in the therapy of behavior disturbances, T 
good deal of its impetus from the application of co 
in the study of experimental neuroses. Whereas th 
considered are concerned with understanding personality dynamics, in- 
vestigations of experimental neuroses and behavior therapy аге concerned 
with changing certain aspects of personality functioning, 

An examination of the literature on personality and с 
quickly reveals the extent to which further advance is depend 
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on the solution of basic issues in conditioning as on conceptual develop- 
ment in relation to personality functioning. For this reason it has been 
found necessary to devote a good deal of attention in the following pages 
to problems of methodology, and to such fundamental issues as the criteria 
of conditioning. 

Although the procedures of classical conditioning of involuntary 
are often referred to as "simple conditioning," it is becoming 
evel of behavioral modification, we are 
confronted with sufficient complexity to make interpretation far from 
straightforward. Since the study of the instrumental conditioning of 


voluntary responses in human beings (eg. verbal conditioning) intro- 
duces further complexity, it has been considered wisest to place exclusive 
age. In any case, classical con- 


emphasis on classical procedures at this st 
ditioning studies account for the great majority of the investigations of 


personality and conditioning which have been carried out to date. 

Recently Diamond, Balvin, and Diamond (1963) have declared in 
favor of a return to the use of the terms “conditional” and “unconditional” 
in place of “conditioned” and “unconditioned” reflexes. These authors 
point out that “the context of Pavlov's discussion never leaves any doubt 
that the meaning he wished to convey is that of the words conditional 
and unconditional. He spoke of the conditional reflexes as ‘temporary 
connections,’ and he emphasized that they lack the certainty and regular- 
ity of occurrence of innate reflexes which can, by contrast, be called un- 
conditional; that is, the reaction is sure to occur in response to the ade- 
quate stimulus despite other circumstances and conditions" (Diamond 
€t al., 1963, p. 183). Following the lead of these writers, the terms condi- 
tional and unconditional will be used in the present paper in place of 
the more conventional conditioned and unconditioned. 


responses 
increasingly clear that, at this 1 


s of Personality Dynamics 


RELATIONS BETWEEN INDEPENDENTLY DEFINED PERSON- 
AND PARAMETERS OF CONDITIONING PERFORMANCE 


IL Investigation 


A. SrUDIES OF 

ALITY VARIABLES 
1. Tests of Predictions from Personality 
e theory of personality which has given rise 
to explicit predictions concerning the differential conditioning perform- 
listinguished on the basis of personality is that of Eysenck. 
allotted a crucial role in Eysenck's theory. In the develop- 
ment of his theory, Eysenck began with a study of the problems of taxon- 
omy or classification. At this stage the question asked was, "What are 
the major dimensions of personality with respect to which persons vary?" 
The answer, proposed on the basis of previous findings and original 


Theories of Wide Generality 
The only comprehensiv 


ance of groups d 
Conditioning is 


118 S. Н. LOVIBOND 


research, was that most of the variance in personality 
accounted for in terms of the three orthogon 
icism, neuroticism, and introversion-extraversion. Psychoticism is defined 
аз a predisposition to develop such symptoms of mental disorder as delu- 
sions, hallucinations, mood disturbances, motor retardation, and the like. 
Neuroticism is identified with emotionality or lability of the autonomic 
nervous system, which is considered to act as a predisposition to neurotic 
disorders. The introversion-extraversion dimension is defined in terms of 
a wide range of behaviors. The behavior of introverts is characterized 
by a relative lack of sociability, high persistence, high level of aspiration, 
an emphasis on accuracy rather than speed, reliance on inner standards 
of conduct, and a stress on moral scruples. Extraverts, on the other hand, 
are sociable, impulsive, dependent on the social valuations of others, low 
in level of aspiration, and tough minded in their attitudes. Following 
Jung, Eysenck proposes that hysteria is the Syndrome to be found in the 
extraverted neurotic, whilst dysthymia (syndrome characterized by an- 
xiety, reactive depression, and/or obsessive-compulsive features) is typ- 
ically found in the introverted neurotic. 

In seeking likely neurophysiological mech 
differences between extraverts and introverts, 
individual difference and typological postulates in terms of excitation- 
inhibition balance. The essence of these postulates is that individuals in 
whom excitatory potentials are generated quickly and strongly, and in 
whom inhibitory potentials are generated slowly and weakly, will tend 
to be introverted in personality, and to develop dysthymic disorders in 
the case of neurotic breakdown. Conversely, individuals who generate 
weak excitatory potentials slowly, and who generate strong inhibitory 
potentials quickly, tend to be extraverted in personality, and to develop 
hysterical-psychopathic disorders in case of neurotic breakdown. 

A link between excitation-inhibition balance and (a) the person- 
ality patterns of introversion versus extraversion, and (b) hysterical- 
psychopathic disturbances versus dysthymia, is provided by conditioning. 

In brief, a key difference between extraverts and introverts is the 
degree of socialization which is typical of each. Socialization, or the estab- 
lishment of social controls over egoistic impulses, is mediated by condi- 
tioning. Because of their rapid and strong development of excitation and 
their weak tendency toward the development of reactive inhibition, intro- 
verts condition well, and hence tend to become Oversocialized, Conversely, 
the slow development of weak excitatory potentials, and the rapid and 
strong development of inhibitory potentials, makes extraverts condition 
poorly. As a consequence extraverts tend to be undersocialized. 

Strong autonomic-emotional lability and reactivity 


functioning can be 
al dimensions of psychot- 


anisms of the personality 
Eysenck formulated his 


produce excessive 
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fear reactions to painful stimuli in all persons high on neuroticism, but 
in the introverted neurotic, the strong capacity for conditioning causes 
these fear reactions to become attached to a multitude of fortuitous stim- 
uli, thus producing the excessive anxiety reactions of the dysthymic. The 
dysthymic, then, is characterized by oversocialization and excessive an- 
xiety, whilst the hysteric is characterized by undersocialization and 
autonomic lability without excessive conditional anxiety. 

It is clear that Eysenck's theory postulates a general factor of condi- 
tionability or acquisition of CR's. It also leads to the prediction that 
introverted normals will condition more rapidly than extraverted normals 
with any reliable conditioning procedure. Furthermore the theory pre- 
dicts that neurotic introverts (dysthymics) will condition better than 
unselected normals, and that neurotic extraverts (hysterics апа psycho- 
paths) will condition less well than unselected normals. There is no 


suggestion, however, that neuroticism, anxiety, or emotionality as such 


will be related to conditioning performance. 
The first experiment designed to test these hypotheses was carried 


out by Franks (1956), who used the eyeblink conditioning method with 
90 normals, and 20 dysthymics. In accordance with predic- 
tion, the groups were ordered dysthymics, normals, and hysterics in terms 
of rate of conditioning and resistance to extinction. In a later experiment, 
Franks (1957) used 55 normal undergraduate $s who had been given the 
Maudsley Personality Inventory (MPI). This inventory included an extra- 
version (£) scale. Eyeblink conditioning was again used, and the correla- 
tion between conditioning and extraversion was —.46 for acquisition, and 
7.94 for extinction. 

More recent studies of eyeblink conditioning in relation to scores on 
the E scale of the MPI have mostly produced relationships in the direction 
predicted by Eysenck, but few of the correlations have reached statistical 
significance. Willet (1960) used the Fisher Z transformation to obtain an 
estimate of the true correlation between eyeblink acquisition score and 
the E scale. The estimate, which was based on the studies of Franks 
(1957), Das (1957), O'Connor (1959), and Willet (1960), was —.188. Since 
Willet's publication, further investigations of the relation between the E 
scale and eyeblink conditioning have included those by Barendregt (1961), 
Field and Brengelmann (1961), and Sweetbaum (1963). Barendregt re- 


Ported a correlation of —.29 but Sweetbaum obtained a correlation of 
ann found that their introverted 55 condi- 


20 hysterics, 


near zero. Field and Brengelm. 
toned insignificantly faster than their extraverted 5s. It would appear 


that the results of the later investigations agree with those cited by Willet 
(1960) in suggesting that the relationship between the Е scale and eye- 
blink conditioning is best represented by a correlation of —.2. 
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Several investigators have studied the relationship between extra- 
version questionnaire measures and GSR conditioning. Becker (1960) and 
Becker and Matteson (1961) twice presented a list of 104 words spaced 
at 5-sec intervals. The word "repeat" occurred 20 times followed bya 
.L-sec shock on 12 trials. Using two different measures of conditioning, 


Ss with introverted scores on Guilford's R scale were not differentiated 


from Ss with low scores on the scale, in either experiment, despite the 


fact that in the second experiment extreme scoring groups were selected. 
Vogel (1960, 1962), however, using a very similar GSR method, ob- 
tained highly significant differences in conditioning performance between 


extraverts and introverts. Lovibond (1963a) also used the GSR in a study 


of conditioning in 63 university students for whom E-scale scores were 


available. There were two conditioning sessions. In one session the US 
was electric shock and in the other the US was 


provided by pictures of 
nude females. The correl 


ation between E score and a measure of rate of 
aversive conditioning was —.05. The correlation for positive reinforce- 
ment was —.10. 

A change of conditioning technique appears to leave the correlation 
between the £ scale and conditioning performance essentially unchanged, 
and it is clear that the amount of common variance between the two 
types of measures is small. Possible reasons for the failure to demonstrate 
a stronger relationship, other than invalidity of the theory include un- 
reliability of the conditioning measures, and inadequacy of the E scale 
as a measure of extraversion. By and large it is clear that the reliability 
of both eyeblink and GSR conditioning measures is reasonably satisfac- 
tory (Becker, 1960; Franks, 1957; Lovibond, 1963b; Welch and Kubis, 
1947). 

It will be suggested in Section III, D,2 that the validity of GSR con- 
ditioning measures can often be questioned, but the arguments advanced 
there do not apply to the Becker (1960) experiment, nor to Povibondis 
(1963a) positive reinforcement measure, both of which failed to produce 
an appreciable correlation with E scores. Willet (1960) has аг E that 
the Æ scale is a very poor measure of extraversion, апа cites yes of 
Claridge which showed that careful behavioral ratings produced: ncs M 
relationships whereas questionnaire scores did not, Eysenck (Бурап and 
Claridge, 1962) has recently recognized that the Е Scale is not unidimen- 
sional, and in fact measures a "behavioral" extraversion factor aswell dš 
a “constitutional” extraversion factor, the former being ысу i 
predictions from the original theory. Whilst a purified E scale nas im: 
prove the position, it is likely that questione Studies of personality- 
conditioning relationships have reached the limit of their fruitfulness 
It would now seem advisable to concentrate research effort on the devel- 


PERSONALITY AND CONDITIONING 121 


opment of a set of rating scales along the lines of those developed by 
Olson (1930) for the study of behavior disorders in children, and to use 
these scales in conjunction with objective tests of excitation-inhibition 
balance (Eysenck and Claridge, 1962). 


2. Tests of Predictions from Theories of Limited Aspects of Per- 
sonality Functioning. 


a. Anxiety and Conditioning. The personality variable which has 
been most studied in relation to conditioning performance is anxiety. 
Since the majority of personality theories had their origins in the field 
of abnormal psychology, where anxiety is a pervasive problem, it is 
understandable that anxiety has become a central personality concept. 
As Jensen (1958, p. 313) has put it, "Few, if any, other concepts sustain 
as much of the superstructure of personality theory.” 

The early studies of anxiety and conditioning were largely empirical, 
but since developments of Hull's general behavior theory gave rise to 
testable predictions concerning the relation between anxiety and con- 
ditioning, interest in this area of investigation has increased markedly. 
Much of the recent work has been carried out by the group at Iowa 
(Spence, 1954; Spence and Taylor, 1951, 1953; Spence and Farber, 1953; 
Taylor, 1951, 1956). 

The aspects of Hullian theory which are of relevance to the relation 
between conditioning and anxiety have been stated by Taylor (1956) 
as follows: 

“According to Hull, all habits (Н) activated in a given situation, 
Combine multiplicatively with the total effective drive state (D) operating 
at the moment to form excitatory potential or E [E = f (H x D)]. Total 
effective drive in the Hullian system is determined by the summation of 
all extant need states, primary and secondary, irrespective of their 
relevancy to the type of reinforcement employed. Since response strength 
18 determined in part by £, the implication of varying drive level in any 
Situation in which a single habit is evoked is clear; the higher the drive, 
the greater the value of E and hence of response strength. Thus in 
simple, noncompetitional experimental arrangements involving only a 
Single habit tendency, the performance level of high drive Ss should be 
Breater than that for low drive groups." 

In a series of experiments designed to test this prediction from Hull's 
theory, the Iowa group chose classical conditioning of the eyeblink as 
‘a simple, noncompetitional experimental arrangement involving only 
a single habit tendency.” In the search for a drive which could readily 
be varied in strength, Spence and his associates made the assumption that 
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anxiety possesses the energizing and reinforcing properties attributed to 
drive in Hullian theory. 

The next step was to provide a means of manipulating the strength 
of the anxiety drive. There are three principal procedures which experi- 
menters have used for this purpose. The first selects Ss who manifest 
symptoms of anxiety in different degrees by means of clinical observation 
or responses to questionnaires. The second induces anxiety experimen- 
tally by means of ego threat (e.g., failure in a competitive situation) or 
threat of painful stimulation (eg., electric shock). The third method 
makes use of real life situations considered to be productive of anxiety 
(imminence of examinations or surgical operations). 

The most favored procedure has been the first named, and most ex- 
periments have made use of a specially devised questionnaire, the Manifest 
Anxiety Scale (MAS). The MAS was constructed by asking clinicians to 
select from a pool of personality questionnaires those items, the endorse- 
ment of which would be indicative of manifest anxiety. Although the 
rationale of this procedure has not been discussed in any detail, it is 
clear that when the MAS was constructed, it was assumed that high 
scorers on the scale were in a chronic state of high anxiety drive. 

A sufficient number of studies has now been carried out to indicate 
that MAS scores are positively related to eyeblink conditioning per- 
formance (Baron and Connor, 1960; Spence and Farber, 1953; Spence, 
Farber and "Taylor, 1954; Spence and Taylor, 1951, 1953; "Taylor, 1951). 


The strength of this relationship is represented by a correlation of about 


+.25. In other words, differences in MAS score account for about 6% of 
the variance in eyeblink conditioning performance. 

Bitterman and Holtzman (1952), Gilberstadt and Davenport (1960), 
and Lacey, Smith, and Green (1955), were unable to demonstrate а 
significant relationship between MAS scores and GSR conditioning. More 
recently, however, Becker and Matteson (1961), using extreme scores on 
the MAS, found that the rate of acquisition of a conditional GSR was 
substantially greater in anxious Ss than in nonanxious Ss. The relatively 
low correlations obtained between the MAS and conditioning, as well 
as the not infrequent failure of scores on the scale to correlate as pre- 
dicted with complex learning performance, has led to a more critical 
attitude toward the MAS, and a 
manipulating the anxiety variable. 
anxiety itself has undergone some 
agreed that the MAS 
ness to defensive 


tendency to use other methods of 
At the same time the conception of 
change. It is now fairly generally 
does not measure present defensive arousal, prone- 


arousal, or chronic, persisting tension, but rather S's 
tendency to react to anxiety in a way “characteristic of the psychasthenic 
neurotic” (Jenkins and Lykken, 1957). There scems to have been a 
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definite shift toward the conception of anxiety as sensitivity to defensive 
arousal or reactivity to threat. Anxiety thus becomes a disposition rather 
than a state, but it is still considered to have drive properties when 
activated by threat or stress. This shift in emphasis in the definition of 
anxiety has resulted in part from the greater frequency of significant 
findings in tests of drive theory when threat has been experimentally 
induced. For example, Beam (1955) made use of the real life stress pro- 
duced by impending doctoral examinations in a study of GSR condition- 
ing. Under stress, conditioning was significantly faster. Sweetbaum (1963), 
in a similar type of study, relied on impending surgical operation to 
induce anxiety. In this case the response conditioned was the eye blink. 
Subjects under stress conditioned very much more rapidly than did con- 
trol Ss who had made a successful recovery from surgery. Willet (1963) 
compared the eyeblink conditioning performance of youths taking part 
in a stressful selection procedure with that of controls who had already 
been selected. The acquisition of the CR. was significantly faster in the 


experimental group. 

Experiments which have made use of clinical ratings of anxiety 
have also tended to produce more significant results than studies using 
the MAS. Some of the earlier experiments of this type were not prompted 
by the theory of anxiety as a drive, but because of their relevance for an 
evaluation of this theory, they are mentioned here. Welch and Kubis 
(1947) investigated the effects of pathological anxiety on the rate of GSR 
conditioning. A group of 51 mixed psychiatric patients was rated for 
anxiety by psychiatrists. The CS was one of 54 nonsense syllables pre- 
sented at a rate of one per 6 sec. The US was the sound of a buzzer. The 


conditioning score was number of trials required for 3 successive GSR's 
ol syllables. АП but 4 of 36 


to the CS greater than the GSR's to contr 
normals scored from 14 to 58, whilst, with one exception, all patients 
diagnosed as having anxiety had a score of 14 or less. The retest correla- 
Чоп on 36 normals was 4-88. Other investigators who have found 
clinically assessed anxiety to be positively related to GSR conditioning 
itterman and Holtzman (1952), Gilberstadt and Daven- 


Performance are В ‘ 
, and Welch (1949). No failures to find 


Port (1960), and Schiff, Dougan 
Such a relationship appear to have been reported. A 

Altogether there is a considerable body of evidence that psychiatric 
Patients suffering from anxiety, and normal individuals subjected to 
Stress, condition more rapidly in experiments using aversive reinforce- 
Ment than do persons not suffering from anxiety. The theory of Spence 
and Taylor, however, predicts similar findings from nonaversive condi- 
According to the theory, anxiety as an irrelevant 


toning experiments. : б Е 
h relevant drive to increase performance in 


drive should summate wit 
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this sort of situation also. In order to test this prediction, Bindra, 
Paterson, and Strzelecki (1955) undertook a salivary conditioning experi- 
ment with high and low scorers on the MAS. The performance of the 
anxious and nonanxious Ss was virtually identical. Bindra et al. inter- 
preted their results as favorable to the view of anxiety as a response to 
threatening situations. The findings of Bindra et al. have been accepted 
uncritically into the literature as indicating that anxious and nonanxious 
Ss perform equally well in classical conditioning experiments involving 
no threat (Eysenck, 1957; Franks, 1960; Jenkins and Lykken, 1957; 
Kimble, 1961; Stewart, Winokur, Stern, Guze, Pfeiffer and Hornung, 
1959). The Bindra experiment, however, is open to such serious meth- 
odological criticism (see Section III, D, 3) that its results cannot be 
accepted as evidence for or against any theory. Hence the issue of whether 
or not anxiety increases performance in nonaversive conditioning must 
be regarded as still open. 

An alternative to the general drive theory of the relation between 
anxiety and conditioning can be derived from recent work in the field 
of electrophysiology and Pavlovian theory. As Malmo (1958) has argued, 
the concept of general drive or D may be regarded as identical with the 
neurophysiological concept of “nonspecific arousal.” Since the pioneer- 
ing work of Moruzzi and Magoun (1949), nonspecific arousal has been 
scen as the chief function of the brain stem reticular formation. It is now 
recognized, however, that the reticular formation can no longer be 
regarded simply as a nonspecific arousal system, the function of which is 


merely that of an excitatory amplifier. The work of such investigators as 


Kaada and Johannsen (1960) has made it abundantly clear that the 
reticular formation selectively activates some areas of the brain whilst it 
inhibits others. As Diamond et al. (1963, p. 228) put it, the work on the 
reticular formation assumes “its full significance only with the recogni- 
tion that multiple inhibitory effects, exerted both upwards and down- 
wards, are essential to all behavior.” 

Microelectrode and intracranial chemical stimulation techniques 
have demonstrated the existence of “centers,” or functional systems 


associated with unconditional reactions of various types, and have con- 
firmed Pavlov’ 


s view of reciprocal inhibitory relationships between these 
centers. The experiments of Wyrwicka and Dobrzecka (1960) on goats 
confirmed the hypothesis of Anand and Brobeck (1951) that the lateral 
hypothalamus contains a “feeding center” and the ventromedial nucleus 
a “satiation center.” Considering this and related experiments, Diamond 
et al. (1963, p. 368) comment that “feeding, as an element of the 
behavioral repertoire, is embedded in several different behavior 


н ^ pairs or 
polar opposites. First, we h. 


ave the contrast between feeding and satiation 
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as originally postulated. Second, there is a conflict between feeding and 
fear, . . . third, in the feeding center itself, there is the differentiation 
between eating and drinking. When an animal chooses not to eat, there 
may be at least three kinds of motivational influence: satiation, fear, or 
felt need for liquid. It is an established fact that any one of these inhibits 
the feeding mechanism." Wolpe (1958), on the other hand, has developed 
à system of psychotherapy based on animal experiments in which recip- 
rocal inhibition of anxiety responses is effected by the activation of 
positive responses such as feeding. 

Anokhin (1961) has elaborated a conception of the selective action on 
the cerebral cortex of reticular and other subcortical nuclei in relation to 
the formation and elicitation of conditional reflexes. Summarizing the 
results of electrophysiological experiments which made “а comparative 
evaluation of the activating effect on the cerebral cortex in two bio- 
logically opposed states: defensive and alimentary," Anokhin (1961) says, 
"It may be assumed that all forms of desynchronization of slow cortical 
activity that arise during the orienting-investigatory, defensive, and ali- 
mentary reactions, although outwardly remaining to some extent similar, 
are actually physiologically entirely specific, ensuring entirely different 
biological activities . . . the activating influence on the cortex is always of 
à functionally specific character . . - only the imperfection of the electro- 
encephalographic index fostered the emergence of the ideas of a 
‘generalized,’ ‘nonspecific,’ and ‘diffuse’ activating effect of the subcortex 


on the cerebral cortex.” 


In the light of these П 
postulate that individuals in a state of strong anxiety, er 
acute, will manifest a decreased capacity for forming positive reward 


conditional reflexes. This prediction is based on the assumption that 
defensive arousal will inhibit the functionally specific arousals considered 
necessary for the formation of positive reward, or appetitive, conditional 
reflexes. No clear-cut evidence relevant to this prediction is available. 
The results of Bindra et al. (1955) are equivocal, and, although Finesinger, 
Sutherland, and McGuire (1942) have shown that neurotic Ss are able to 
form salivary conditional reflexes, no control data were obtained. 
Lovibond (1963a) compared the positive reward GSR: ойбой per- 
formance of high and low scorers on Eysenck's Nenroticism (N) scale, 
Which had previously been shown to correlate highly with the MÁS 
(Eysenck, 1957). There was an insignificant tendency tor Ss with low N 
Scores to perform at а higher level than Ss with high N scores, — 
With the redefinition of anxiety in terms of emotional sensitivity to 
it is now clear that experiments using ques- 
y criteria are of little relevance in the study of 


considerations, it would seem reasonable to 
whether chronic or 


threatening situations, 
tionnaire scores as anxiet 
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the relationship between anxiety апа appetitive conditioning. What is 
required is a study of the effects of real life stress, such äs impending 
surgical operation (Sweetbaum, 1963), on appetitive conditioning per- 
formance using either the GSR technique of Lovibond, or a methodo- 
logically acceptable technique of salivary conditioning (see Section 
III, D, 3). 

We have seen that the two theories which have generated most of 
the research into personality conditioning relations emphasize different 
personality variables as being related to differences in conditionability. 
For Eysenck, the personality dimension which is related to condition- 
ability is introversion-extraversion whilst for Spence and Taylor it is 
anxiety. We have seen also that both variables in fact appear to some 
degree to be related to aversive conditioning performance. Hence the 
question of the relation between these two variables naturally arises. It 
will be remembered that Eysenck's theory of personality postulates three 
orthogonal dimensions, introversion-extraversion, neuroticism, and psy- 
choticism. Neuroticism has been defined tentatively in terms of emotion- 
ality, or autonomic nervous system lability, and since Eysenck has in 
mind defensive emotionality and autonomic lability, it would appear 
that neuroticism and anxiety may reasonably be equated. The essence of 
Eysenck's thinking on the subject appears to be conveyed if we think of 
anxiety-neuroticism in terms of “defense arousability," i.e., sensitivity to 
arousal by aversive stimulation and threatening situations. 

Eysenck has postulated that introverts generate excitatory potentials 
rapidly and strongly. His findings that, in contrast to extraverts, introverts 
have high levels of aspiration, persistence, perceptual rigidity, and seda- 
tion threshold, support this assumption, although Eysenck has emphasized 


the differences in inhibitory potential between introverts and extraverts 
rather than differences in excitatory potential. 


A reconcilation between the views of Eysenck and Spence and the 
accumulated evidence may be effected by acceptance of the following 
propositions. 

1. The level of cortic 


al excitation, which determines the speed of 
formation of condition 


al linkages, is dependent on 

(a) the degree of subcortical arousal and 

(b) the general excitability of the cortex, i.e., the capacity of the 
cortex for developing excitation from a given level of sub- 
cortical arousal. 


2. Individuals differ with respect to three dimensions of nervous 


activity which are at least partially independent: 
(a) general excitability of the cortex, 
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(b) defensive arousability, and 

(c) appetitive arousability. 

3. The personality dimension which is related to general excitability 
of the cortex, and hence to a general factor of conditionability, is intro- 
version-extraversion, and the relationship is such that introverts will 
manifest higher levels of general cortical excitability, and condition- 
ability, than will extraverts. 

4. The personality variable of anxiety is related to level of defensive 
arousability, and a group factor of aversive conditionability. 

5. The relationship between different arousal systems is a mutually 
inhibitory one, so that, e.g., a high level of defensive arousal at a particu- 
lar time will reduce the capacity for appetitive arousal, and hence 
appetitive conditioning performance. 

6. The relationship of reciprocal inhibition between different types 
al will be to some degree asymmetrical. Because of the generally 
sal, the inhibitory effects of this type of 
y observable than the inhibitory effects 


of arous 
greater strength of defensive arou: 
arousal are likely to be more readil 
of nondefensive arousal. 


Table I sets out the predicted differences ir 
anxious introverts, anxious extraverts, 


n the aversive and appeti- 


tive conditioning performance of 
TABLE I 
PERFORMANCE OF VARIOUS GROUPS EXPECTED ON 
RELATION OF RECIPROCAL INHIBITION BETWEEN 
DIFFERENT AROUSAL SYSTEMS 


CONDITIONING THE HYPOTHESIS OF A 


Anxious (stress situation) Nonanxious (nonstress situation) 


Aversive Appetitive Aversive Appetitive 
conditioning conditioning conditioning conditioning 
General Specific General Specific General Specific General Specific 
Introvert + + + — + 0 + 0 
Extravert — pus E amd = 0 — 0 


nonanxious introverts, and nonanxious extraverts. It is important to 
us" refers here to a presently high level of defensive 
this is achieved through a high degree of defensive 
ate stress, or a moderate degree of defensive 


note that "anxio 
arousal, whether 
arousability and moder: 
arousability and a high level of stress. 

xiety dimension alone, the predictions are quite 
clear cut and readily tested. It is predicted that an unselected group 
under real life stress will condition better than nonstress controls with 
aversive reinforcement, but less well with appetitive reinforcement. We 
have seen that the evidence of Beam (1955), Sweetbaum (1963), and 


Considering the an 
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Willet (1963) is favorable to the prediction relating to aversive condition 
ing, but evidence relevant to appetitive conditioning is entirely lacking. . 

Considering the introversion-extraversion dimension separately, it : 
predicted that introverts will condition more readily than extraverts with 
both aversive and appetitive reinforcement. The only unambiguous 
studies bearing on these predictions are those which have used ques- 
tionnaires to select Ss. In all cases except one (Lovibond, 19632), aversive 
conditioning procedures have been used, and the evidence, on balance, 
favors the prediction of superior performance by introverts. In Lovibond's 
(1963a) appetitive GSR conditioning experiment, there was a tendency 
for introverts to perform at a higher level than extraverts. This per 
formance difference, although in the required direction was not signifi- 
cant. 

It was suggested in Section II, A, 1 that the selection of extraverts and 
introverts by means of personality questionnaires alone is an unsatisfac- 
tory procedure. Efforts to obtain behavioral ratings which emphasize the 
temperamental disposition factor of extraversion might well prove 
fruitful, particularly if combined with objective indicators such as level 
of aspiration, persistence, and so forth. Extraverts and introverts selected 
by these means could be equated on the 
questionnaires and measures of autonomic reaction to stress. 

Turning now to the predictions derived from the interaction of the 
two dimensions, it will be seen that, in Table I, pluses, zeroes, and 
minuses are allotted under the various headings. A plus signifies better- 
than-average conditioning expected, a zero indicates average condition- 
ing, and a minus below-average conditioning, The heading "general" 
refers to the expected influence of the assumed general factor, and the 
heading "specific" refers to the expected influence of 

Assuming equal weight for each positive 
algebraic sums of general and group f. 
order of the Broups in terms of each 
The decreasing order of predicted 
the groups is anxious introvert, no. 
nonanxious extravert. The corr 
ing is nonanxious introvert, 
anxious extravert, 


b. Conceptual Thinking Disturbances and Conditioning. From 
studies of the responses of schizophrenics and normals to various tests of 
concept formation, Lovibond (1954) concluded that the characteristic 
feature of schizophrenic thought disorder is a failure to inhibit linkages 
or associations which the context makes irrelevant. According to this 
hypothesis, in normal thinking, the context, including the experimenter’s 


anxiety dimension by means of 


defensive arousal. 
and negative tendency, the 
actor influences give the expected 
type of conditioning performance. 
aversive conditioning performance for 
nanxious introvert, anxious extravert, 
esponding order for appetitive condition- 


anxious introvert, nonanxious extravert, 
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instructions, the nature of the test materials, or the theme of a discussion, 
permits the activation of only a limited number of "relevant" linkages in 
the S's conceptual structures. The majority of the manifold linkages that 
are potentially available are selectively inhibited by a process akin to 
the active, differential inhibition manifest in conditioning experiments. 
On the basis of these assumptions, a method was developed to permit 
analysis of responses to the Object Sorting Test in terms of the in- 
essentiality of the definitions offered (Lovibond, 1954, 1964a). Further 
work with this test indicated that something like 20% of normals obtained 
high, or schizophreniclike, scores. It was concluded that certain aspects of 
schizophrenic-type thinking could conveniently be studied in these high- 


Scoring normals whose thinking was free from the complications of 


mental disorder. 
In an extension of the original theorizing, it was hypothesized that 


the differential inhibition of irrelevant linkages in normal thinking takes 
place through a process of negative induction, which necessitates a 
relatively strong process of excitation. On the basis of this analysis, a 
number of predictions was made concerning the differential conditioning 
performance of normals with high and low scores on the Object Sorting 
Test. In particular, it was predicted that high scorers would show lower 
reactivity (including weaker initial orienting reactions, and a slower rate 
of conditioning) and a lower capacity for differentiation. With use of 
positive, or appetitive, reinforcement of the GSR, the main predictions 
were confirmed. When the GSR was reinforced by electric shock in 
another session, however, the differences between the two groups largely 
disappeared. It was concluded that the stronger activation. from the 
aversive US was sufficient to mask the differences in conditioning per- 
formance between the two groups. 

This study raises the question of the relationship between the 
personality dimension tapped by the Sorting Test, and other dimensions 
known to be related to conditioning performance. The Sorting Test 
dimension would seem to be closely akin to the “cortical excitability” 
assumed by Eysenck to be one of the properties of nervous activity which 
differentiates introverts and extraverts. The finding of stronger dif- 
ferential inhibition in Ss with strong excitatory processes introduces a 
contradiction, however, since Eysenck work suggests a negative correla- 


tion between strength of excitation and strength of inhibition. Never- 


theless, it is possible that this con 


tradiction is more apparent than real. 


Eysenck has been concerned primarily with the “work decrement” type 
Of inhibition, which may be not at all the same process as the active 
assumed by Pavloy to underlie differentiation. In this 


inhibitory process й 
in differential delayed condition- 


Connection it is of interest to note that, 
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ing, when inhibition with reinforcement occurs, the response to the 
differential stimulus usually shows little further decrease, and may 
actually increase. This suggests the possibility that response decrement 
inhibition may be inimical to differential inhibition. No consistent 
correlations have been obtained between Sorting Test scores and scores on 
the MPI, but in view of the criticisms of this and related questionnaires, 


it would be unwise to conclude that the tests are measuring different 
dimensions. 


The correlation of Sorting Test scores and measures of extraversion 


based on ratings and objective tests might well produce a different 
result. 


3. Empirical Studies of Personality and Conditioning 


In a number of purely empirical studies, comparisons have been 
made of the conditioning performance of groups known to differ in 
terms of certain personality characteristics. Usually the groups have been 
distinguished on the basis of psychiatric diagnosis or symptomatology. 

The earliest reported studies of conditionin 
were carried out by Russian investigators, including Ivanov-Smolensky 
(1925), Guk (1934), and Landkof (1938). Usually very small numbers of 
subjects were used, but the invariable finding was that, 
subjects, conditional reflexes were difficult to establish 
established, the reflexes were quite unstable. The refl 
these studies included defensive, motor, 


Bender and Schilder, ( 


g in psychiatric patients 


with psychotic 
, and, when finally 
exes conditioned in 
, and avoidance reactions. 

1930) in agreement with the Russian findings, 
reported difficulty in establishing conditional avoidance reactions in à 
small group of schizophrenics. Mays (1934), however, found that con- 
ditional GSR's were more resistant to extinction in a group of 20 
schizophrenics than in 16 normal controls, Shipley (1934) studied the 
acquisition and resistance to extinction of conditional GSR's in psychi- 
atric patients. The Ss were 17 schizophrenics, 
mixed neurotics, and 6 norm 
tion at the fastest rate, w 


10 manic depressives, 9 
als. The schizophrenics were found to condi- 
ith normals next, followed by 

sives and neurotics. The neurotic Ss were the most resist 


with the other groups undifferentiated on this measure. 

Pfaffman and Schlosberg (1936 
cedure with schizophrenics, manic 
phrenics tended to be the least re 
of conditioning was too low 


the manic depres- 
ant to extinction, 


) used a knee jerk conditioning pro- 
depressives, and normals. The schizo- 
sponsive group, but the over-all level 
to permit definite conclusions. 
Peters and Murphree (1954) investigated differential С 


SR condition- 
ing to shock in schizophrenics and normals. 


The difference in response 
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to the conditional stimulus and the differential stimulus after training 
was markedly less in the schizophrenics. 

Taylor and Spence (1954) compared the rate of eyeblink condition- 
ing of 74 Ss suffering from neurosis or personality disorder, and a group 
of 42 psychotics, including 31 schizophrenics. The difference in per- 
formance favoring the psychotics just failed to reach the 5% level of 
significance. 

In a large-scale study of GSR conditioning, Howe (1958) used 60 
hospitalized Ss with anxiety states, 60 hospitalized chronic schizophrenics, 
and 60 normal 5s. Taking magnitude of response during extinction as the 
measure of strength of conditioning, Ss with anxiety states showed stronger 
conditioning than schizophrenics or normals. The difference between the 
latter groups was not significant. 

a further study of GSR conditioning in psychiatric patients, 
Stewart et al. (1959) used 70 55, comprising 27 manic depressives in the 
depressed phase, 18 schizophrenics, 15 Ss with personality disorders, and 
10 with anxiety neurosis. The US was shock, and a delayed conditioning 
procedure was used with a 4-sec CS-US interval. GSR’s were classed as 
orienting responses, responses to shock, or anticipatory CR's. It was found 
that Ss with personality disorders conditioned more rapidly than either 
schizophrenic or manic depressive Ss, and that Ss with anxiety neuroses 
tended to condition more rapidly than those with schizophrenia. The 
resistance to extinction of personality-disordered Ss was significantly 
greater than that of the schizophrenics, but no other group differences 
“he frequency of orienting responses to the CS during 
ater in the anxiety neurotics than in either 


In 


were significant. T 
training was significantly gre 
of the psychotic groups. | 

Another finding of interest was that the rate of adaptation to the 
CS before pairing with the US began was significantly slower in the 
anxiety neurotics than in either schizophrenics or manic depressives. 

The contradictions in the reported findings which have been re- 
viewed, and the marked differences in the procedures used in the various 
Studies, make interpretation somewhat hazardous. Nevertheless, the 
a number of conclusions which perhaps need not be 
ace those studies which have used neurotic Ss 
anxiety have added further weight to the 
already established fact that anxious persons tend to acquire aversive 
CR's more rapidly, and to extinguish them more slowly than other 
Broups. Apart from one or two inexplicable discrepancies, it appears that 
schizophrenics tend to be less generally responsive to either mild or strong 


Stimuli, and also to condition rather less readily than normals. The 
ack of responsiveness of schizophrenics is in 


€vidence points to 
too tentative. In the first pl 
Suffering from symptoms of 


conclusion concerning the 1 
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accord with a considerable amount of evidence of a nonconditioning 
nature. There is quite a strong suggestion that schizophrenics tend to be 
deficient in “arousal” or “drive,” and that this manifests itself in a 
somewhat retarded rate of conditioning (Lovibond, 1963a). In view of 
Meehl's (1962) suggestion that schizophrenics may show a deficiency of 
functioning of the positive reinforcement centers, it is interesting to 
speculate whether schizophrenics would not show a more marked 
deficiency in positive reward conditioning than in aversive conditioning. 
Lovibond's (19632) findings with normals who gave schizophreniclike 
responses to the Object Sorting Test (see Section 1I, A, 2, b) suggest that 
this may well be the case, 

The finding that psychiatric patients, other than schizophrenics and 
anxiety neurotics, usually condition less well 11 
reflects a somewhat lowered cortical efficiency w 
patients, but w 


han normals, possibly 
hich is common to all 
hich sometimes does not manifest itself if the patient has 
heightened drive relevant to the response being conditioned. 


B. STUDIES IN THE DEFINITION OF PERSONALITY 


VARIABLES THROUGH 
CONDITIONING 


Very recently, some noteworthy advances 
by Soviet investigators in the development 
types of higher nervous activity. ОГ particular interest is the work of 
Teplov and his associates (Nebylitsyn, 1961, 1963; Nebylitsyn, Rozhdest- 
venskaya and Teplov, 1960; Rozhdestvenskaia, Nebylitsyn, Borisova and 
Ermolaeva-Tomina, 1963; Teplov, 1961). 

The essential difference between the approaches of Soviet and 
Western workers to problems of personality and conditioning can be 
illustrated by a comparison with the work of Eysenck (see Section Il, 
A, 1). Eysenck's initial investigations were made at the level of personality 
functioning. The first task was to determine the major dimensions of 
personality. The next step was to seek, at the neurophysiological level, 
for likely mechanisms of the differences Observed at the personality level. 
Predictions derived from the theory of neurophysiological mechanisms 


were then tested by the procedures of experimental psychology, including 
conditioning. 


The method of the Soviet Workers is 
starting point is at the level of hypotheti 
nisms suggested by Pavlov's theory of brain functioning and general 
behavior. Laboratory procedures, with an emphasis on conditioning, 
are used in an attempt to define the major dimensions with respect to 
which people vary at this level. The next step in the program is to move 
to the level of personality, and there to seek correlates of differences at 


appear to have been made 
of Pavlov's conceptions of 


quite different. For them the 
cal neurophysiological mecha- 
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the neurophysiological level. Teplov's position with respect to the relation 
between personality theory and general behavior theory has been very 
clearly stated: “The problems of the psychology of personality are first 
and foremost problems of general psychology" (Teplov, 1961, p. 21). 
Teplov believes that, before proceeding to questions of typology, it 
is necessary to carry out a much more thorough investigation than 
hitherto of the basic properties of the nervous system—the dimensions 
of higher nervous activity. The starting point of a series of investigations 
was Pavlov's conception of the fundamental properties of nervous 
activity: strength, balance, and mobility. Instead of taking these proper- 
ties as given, Teplov raised the question: In the higher nervous activity 
of human beings, is each of these properties a single, unidimensional 
property, or a group of similar properties? 
as the first property investigated. For this purpose it was 


Strength wa 
necessary to develop a number of laboratory test methods, including 


Most of these techniques are based on the 
ssence of the strength property is 
hod of reinforced extinction of 


conditioning procedures. 
assumption that the physiological e 


maximum nerve cell capacity. The “met 
the photochemical reflex, with and without caffein” is fundamental. The 


essence of this method (Rozhdestvenskaia et al., 1963) is as follows. A 
conditional reflex reduction in absolute visual sensitivity of the dark- 
adapted eye is elaborated to a visual or auditory CS, using a very strong 
light as US. After consolidation of the CR, its magnitude is measured in 
one "experiment." (In Soviet literature it is often necessary, as in this 
case, to read for "experiment" the word "trial") The measure of CR 
magnitude is percentage diminution of sensitivity to the CS relative to 
the background level of sensitivity. Ten reinforcements or CS-US com- 
binations are then given with 9-min intervals, followed by a further 
measure of CR magnitude. In another session the procedure is repeated 
after the administration of 0.2gm caffein. The measure of strength 
derived from this method is the percentage change in magnitude of the 
CR from the first to the second test trial, ie, following “reinforced 
extinction." A diminution of the CR on retest is an indicator of weakness, 
Or in other words the indicator of strength is the amount of resistance to 
"inhibition of reinforcement." The administration of caffein is designed 
to increase the excitability of the nerve cells, and hence to reduce their 
Capacity to withstand prolonged stimulation. | | 
Other techniques are based on the assumption that strength will 
Manifest itself in concentration of excitation and positive induction ef- 
fects under circumstances where weakness will lead to irradiation, and 
diminished, rather than increased, response. 


The research of Rozhdestvenskaia et al. (1963) may be taken as 
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illustrative of work in this area. In this study, 17 techniques were 
employed with each of 40 Ss to determine nervous system strength. Each 
of these methods had been individually studied and the theoretical 
validation (construct validity) of each was generally accepted. In addition, 
four measures of magnitude and rate of extinction of vascular orientation 
responses were included. The 17 strength measures included four 
measures of reinforced extinction of the photochemical CR (visual and 
auditory CS, with and without caffein) and four variants of the “induc- 
tion method” (measures of effects of a punctuate extraneous stimulus on 
sensitivity to a second punctuate stimulus). Four other measures were 
derived from ergographic recordings under conditions of diminished 
nerve cell capacity, achieved by the administration of caffein, and 
muscular fatigue. 

Centroid factor analysis of rho rank order correlations revealed a 
clear strength factor. Thirteen of the 17 strength measures showed high 
loadings with respect to this factor. A second factor was tentatively 
identified as “nervous system stability.” The only measures showing high 
loadings with respect to Factor II were the four vascular orientation 
reflex indices. A third factor, loading only “photochemical reflex to 
acoustic stimuli with and without caffein,” was interpreted as a special 
intensity effect of the auditory CS. 

Nebylitsyn (1961) has published an account of a similar factor 
analytic study in which the principal strength measure was inhibition 
with reinforcement of the conditioned alpha blocking response of the 
EEG. 

The results of studies such as these have led Teplov and his 
associates to conclude that strength is a unidimensional property of the 
nervous system. The position is not nearly as clear with respect to the 
properties of mobility and balance. According to Teplov (1961, p. 36), 
theoretical considerations and factual evidence suggest that the property 
of mobility “includes all the time characteristics of the work of the 
nervous system, and all aspects of this work in which classification by 
speed can be applied.” Possible indices of mobility suggested by Teplov 
include (a) the speed of development and arrest of nervous processes, 
and (b) the speed of replacement of excitation by inhibition, and of 
inhibition by excitation. According to Nebylitsyn (1962), indices of 
mobility have not been found to correlate highly, and the conviction is 
growing that it is not a unidimensional property. 

Teplov (1961) takes the view that balance is a secondary, derivative 
property of the nervous system, “for the simple reason that here it is a 
question of balance in strength or mobility .... It is now hardly a matter 
of doubt that there can be disequilibrium in the mobility of the nervous 
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processes" (p. 35). Nevertheless, Nebylitsyn (1963) has carried out an 
investigation based on the hypothesis that balance is a unitary property. 
In this study, seven indicators of balance were used with 22 student 
subjects. On the assumption that "one must give preference to the 
'conditional reflex (involuntary) indicators of the relationship among 
nervous processes," two reference indicators were defined. These were 
(a) the rate of extinction of the conditional-reflex depression of the alpha 
rhythm elaborated in the course of "activating" reinforcement (see 
Section III, D, 4) and (b) the rate of elaboration of differentiation to an 
acoustic stimulus in alpha depression conditioning. The other indicators 
were a strength of CR measure, a measure of response to the "uncondi- 
tional" light stimulus, and measures of strength and rate of extinction of 
the initial orienting response (alpha depression) to the acoustic CS. 

The results revealed. that all of the intercorrelations between 
measures, except two, were significant at the 5% level or better. It was 
concluded that the results supported the hypothesis that а common 
property of the nervous system, that of equilibrium, underlay the various 
response measures. Dominance of excitation over inhibition "causes 
greater resistance to extinction of alpha rhythm depression, orientation, 
and conditional reactions, retardation of differentiation, and longer 
depression of the alpha rhythm upon application of orienting, conditional 
and reinforcing stimuli" (Nebylitsyn, 1963, p. 27). When excitation and 
inhibition are in equilibrium, or inverse relationship, the opposite 
picture is sometimes observed. 

Presumably, as indicators of equilibrium become more firmly based, 
it will be possible to begin to answer the question of the extent to which 
strength and equilibrium, or balance, are independent dimensions of 
nervous activity. Certainly the orienting reflex activity measures used by 
Rozhdestvenskaia et al. (1963), which were similar to those of Nebylitsyn 
(1963), were virtually independent of the main strength measures. 

The mobility concept is likely to continue to be recalcitrant as there 
seems to be considerable overlap between "Teplov's suggested mobility 
indicators, which were referred to earlier, and the present measures of 


€quilibrium. 

A further question ma 
Ship between "strength," 
intense and prolonged stimul 
Of as “strength of excitation” 


y be raised at this point. What is the relation- 
measured in terms of capacity to withstand 
ation, and what would ordinarily be thought 
and which would be measured in terms of 


amplitude of response? Since it has been shown in a number of studies 
that strength and sensitivity (in terms of thresholds) are inversely related, 
1t might be expected that strength in these two senses would be inversely 
related also. In the study of Rozhdestvenskaia et al. (1963) there were two 
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amplitude measures: the mean levels of the vascular orienting reactions 
to the auditory stimulus and to the visual stimulus. Neither of these 
measures correlated at all highly with any of the measures of capacity to 
withstand adverse conditions of stimulation. However, the auditory and 
visual vascular amplitude measures correlated only .29 with each other. 


This low correlation suggests low reliability in one or both of these 
measures. 


It is not surprising, nevertheless, 


that the two vascular magnitude 
measures correlated reasonably 


well with their respective. extinction 
measures, since the two types of measures were not really independent. 
That is to say, if, as in this case, the measure of extinction is the level of 
response after a given number of applications of the stimulus, or the 
number of stimulus presentations necessary to reach zero response, the 
extinction measures of Ss with initially high levels of response will be 
relatively low, unless they show a very rapid decrease in response with 
continued stimulus presentations. Usually this will not occur to any 
marked degree, and the result will be an inverse relationship between 
initial response level and “rate of extinction.” In Nebylitsyn’s case, such 
a result makes possible an interpretation in terms of predominance of 
excitation. If, however, the slope of the extinction curve is used as а 
measure of rate of extinction, a positive relationship may be obtained 
between initial magnitude of the orienting reaction and its rate of extinc- 
tion (Lovibond, 19632). Clearly in this Сазе one can no longer talk in 
terms of predominance of excitation or inhibition. It should be noted 
that Nebylitsyn is not alone in using the type of extinction measure he 
favors, since this measure is conventional in the Western literature. 
Despite contradictions and queries such as those mentioned above, 
one cannot but be impressed by the work of Teplov and his associates. 
The character of this work has been stated modestly by Teplov: “We are 
striving to obtain such information about the individual differences in 
people as will form some System. We are attempting to put forward 
hypotheses which can be verified experimentally, We have chosen the 
path from the physiology of higher nervous activity to psychology. We do 
not consider that this is the only possible path, but we are firmly con- 
vinced that it is one of the possible paths.” To this might be added the 
observation that the path being followed js potentially a very rewarding 
one, and one hopes that as more details of the work become available, 
Western psychologists will be induced to join in its exploration. 


~ 
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III. Problems in the Investigation of Personality 
Dynamics by Conditioning Procedures 
A. NONASSOCIATIVE FACTORS AND THE MEASUREMENT OF CONDITION- 
ING 
It has long been known that not all changes in the response measures 
which occur during conditioning represent "true conditioning," or 


"experimental extinction," in the sense of being a direct function of 


CS-US pairing and unpairing. It is customary to describe as “nonassocia- 
influences on response measures which do not result 
from the association (or subsequent dissociation) of CS and US. Non- 
associative factors giving rise to response increment have been referred 
10 as pseudoconditioning and reflex sensitization. Nonassociative factors 
producing response decrement have been termed habituation and adapta- 
tion. All of these factors, but particularly those leading to response 
increment, tend to confound measures of CR acquisition. 

It has become conventional to refer to the control procedure of 
presenting the US only during the conditioning phase as pseudocon- 
ditioning control, whereas presentation of the CS and US separately is 
termed sensitization control. 10 is unlikely, however, that different 
mechanisms are responsible for pseudoconditioning and sensitization, 
and it is convenient to use the term pseudoconditioning to refer to non- 
associative response increment in general. The precise nature of pseudo- 
conditioning effects is still a matter of controversy (Martin, 1962). 
at pseudoconditioning results from a reduction of 
avis, Buchwald, and Frankmann, 1955). 
Wickens and Wickens (1942) have argued that pseudoconditioning is a 
process essentially the same as that of conditioning proper. These authors 
emphasize the associative elements which must necessarily be present in 
any pseudoconditioning experiment, е.р., the similarity of the situation 
in which the CS and US are presented. Recent electrophysiological studies 
(Voronin and Sokolov, 1960) suggest that nonassociative response incre- 
ment is often a manifestation of the return of the generalized orienting 
reflex when reinforcement is introduced. Autonomic responses, such as 
GSR, heart rate change, and vasomotor responses, tend to occur as part 
of a generalized orienting response to any sudden stimulus change. Con- 
sequently, when the CS is presented alone during habituation or adapta- 
tion trials, the GSR, for example, occurs as an 
Repeated presentation of the CS usually results in a marked diminution 
in the amplitude of the response to the CS. Voronin and Sokolov (1960) 
have produced evidence that this adaptation of the orienting reflex 
results from the development of a process of inhibition, and not merely 


tive factors” any 


One view is th 
peripheral response thresholds (D 


*unconditional" reaction. 
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from a decrease in arousal. Presentation of any new stimulus after adapta- 
tion disinhibits the orienting reflex, and produces a marked increase x 
response to the CS. If, however, the new stimulus is an aversive US; pu 
as shock, the increased response to the CS is of greater amplitude, Е 
is more prolonged, indicating an increase in arousal as well as uror 
tion of the orienting reflex. As conditioning proceeds with further pate 

pairings, the generalized orienting reflex gives way to more speci e 
arousal (Voronin and Sokolov, 1960). If the response being иши: 
is an indicator of generalized arousal, there will be a resultant tendency 
toward response decrement during this Stage. Presentation of isolated test 
stimuli, in a context of repeated CS-US pairings, will again tend to dis- 


hih; А CMS : Аа 0% 
inhibit the generalized orienting response, as will the change from 100% 
reinforcement to an extinction series. 


aM ч associative 
It has often been assumed that pseudoconditioning, or nonassociat 


response increment, is limited to conditioning involving aversive stimula- 
tion (Kimble, 1961). Recent work has suggested that this type of response 
increment may also occur with “neutral” reinforcement, as in condition- 
ing of the alpha blocking response, and with positive reinforcement, as 
in salivary conditioning (see Section HI, D). 

The investigation of personali 


à "NA ls 
ty dynamics by conditioning methoc 
would benefit greatly from detaile 


d investigations of the mechanism Г 
н н ртт М > : he 
nonassociative factors in conditioning. It is unlikely, however, that t 


5 ү ERR ; of 
operation of these factors can be elucidated fully within the confines О 
the typical "acute," or very short-term cond 


while, on the reasonable assumption that 
laws which differ from those of conditioni 
personality and conditioning must seek wa 
tive influences from their response measu 
this aim are considered in the discussio 
cedures in Section ПІ, D, where it is 
delayed and differential conditioning off 
lem of obtaining uncontaminated condi 


litioning experiment. Mean- 
pseudoconditioning follows 
ng proper, investigators OF 
ys of eliminating nonassocia- 
tes. Possible ways of achieving 
n of specific conditioning pro- 
Suggested that the methods of 
er partial solutions to the prob- 
tioning measures, 
B. THE INFLUENCE or CENTRAL “Ser” FACTORS 
Although classical conditioning is concerned wi 
of involuntary responses, performance during cond 
may be markedly influenced by complex centra 
usually described in terms of “set,” "expectancy, 
influence of such factors is perhaps most Striking w 
conditioned may be performed both voluntarily 
eyeblink conditioning, for example, sets induced 
your reactions take care of themselves” or « 


th the modification 
itioning experiments 
l factors which are 
' or “attitude.” The 
hen the response to be 
and involuntarily. In 
by instructions to “let 


be sure not to wink before you 
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feel a puff," can produce very clear-cut facilitatory or inhibitory effects. 

In an experiment on the effect of instructions on differential eye- 
blink conditioning, Hilgard and Humphreys (1938) used both supporting 
and antagonistic instructions. They found that responses increased in 
frequency and amplitude and decreased in latency—when they were 
ordered—from voluntary restraint, through uninstructed, to voluntary 
facilitation. It should be emphasized that these effects occur in relation to 
involuntary responses, i.e, responses outside the range of latencies 
appropriate to voluntary self-induced responses. Furthermore, voluntary 
restraint does not eliminate conditional responding. 

When eyeblink conditioning is used in personality study, it is usual 
to give facilitatory instructions, but it is not a simple matter to determine 
the extent to which the instructional set is counteracted by self-induced 
sets. Razran (1936) suggests that, in salivary conditioning, attitudinal 
factors are responsible for greatly increased variability of response. 
Stability of performance is enhanced if Ss have no knowledge of con- 
can be kept unaware of the nature of the experiment 
is task. 

*if the inconsistency and variability 
ence of the subject's 


ditioning, or if they 
by engaging them in some extraneou 
Al-Issa (1963) has assumed that 


of adult human conditioning is due to the influ 
verbal and attitudinal activities, then by the mere absorption of the 


Subject in performing some task, these interferences may be ruled out 
and conditioning may become regular and reliable. These attitudinal 
interferences are reduced because the absorbing activity takes the sub- 
ject’s attention from the CS and the US and thus he will have less 
voluntary control over the activities related to conditioning.” Accordingly, 
in an eyeblink conditioning experiment conducted by Al-Issa (1963), one 
Broup had the task presented to them as a reaction-time study. Subjects 
in this group were required to respond to the CS as rapidly as possible 


by pressing a Morse key. Key pressing was found to result in significantly 


higher levels of responding. 
In an early study of the effect of instructions on the GSR, Cook and 


Harris (1937) gave their Ss a series of adaptation trials with the stimulus 
to be conditioned (light). No mention was made of the fact that an 
electric shock was to be used in the experiment. One group of Ss was 
then told that electric shock would follow the light on subsequent trials. 
A further test with the CS alone showed a greatly increased level of 
The subsequent magnitude of response did not 
irings. Cook and Harris further in- 
Vestigated the effect of instructions on extinction. They found that when 
Ss were told that the light would no longer be followed by shock, the GSR 
to the light was very rapidly eliminated in comparison with controls. 


response in this group. 
increase further with 30 CS-US pa 
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The effect of S's expectations on his reactivity is illustrated in a 
study by Lovibond (19632). Negative reinforcement of the GSR (shock) 
was used in one session, and positive reinforcement (nude pictures) was 
used in another. The test sessions were approximately ] week apart, and 


i fad ch 
order of sessions was counterbalanced. Before the beginning of ea 


М 5 se LO 
session Ss knew whether shock was to be used or not. The response t 
the first adaptation trial of the CS wa 


as approximately 50% higher in the 

shock session than in the nonshock sessions. " 
In a recent study of the effect of instructions on GSR conditioning, 
Silverman (1960) showed that the CS-US interval interacts with the effects 
of instructions. Wickens, Allen, and Hill (1963) found that instructions 
to the effect that no more shocks are to be presented do not immediately 
eliminate GSR's to the CS, but rather result in a faster rate of extinction. 
It is clear that the course of conditioning, differentiation, and extinc- 
tion can be influenced by central processes arising partly out of ne 
textual stimuli, including instructions, and partly out of the varied 
attitudes and expectations which Ss bring to the laboratory. It seems 
probable that most of these influences are mediated by language, and 
hence it is likely that their laws of operation differ from those governing 
conditioning of simple responses. If this is the case, it follows that these 
factors must serve to increase the error variance in conditioning measures. 
Further, since these central factors undoubtedly play a more аро. 
onditioning procedures than in others, the instrusion О 
these influences will tend to vitiate cross-technique comparisons of 
conditioning. Nevertheless, it is easy to exaggerate the influence of 
complex central factors, Most of the dramatic effects obtained with GSR 
conditioning (such as rapid extinction when no shock instructions аге 
given, and increased response to the CS when Ss are merely told of 
impending shock after the CS) act on nonassociative components in the 
response measures. As we gain more understanding and control of non- 


associative factors in conditioning, our understanding and control of 
central set factors should likewise increase, 


C. GENERAL FACTOR or CoNprrioN An rry 


A problem which is frequently 
ditioning and personality is the e 
conditionability, or е 


raised in ге] 
Xtent to whi 
азе of conditiona] reflex 
performance in the various types of conditionin 
employed. It is argued that theories of Personality, such as that of 
Eysenck, assume the existence of à strong genera] factor, bur iPods 
has not been shown to exist. In fact, measures of conditioning lave seldom 
been found to correlate at all highly. At first sight it would appear to be 


ation to studies of con- 
ch a general factor of 
acquisition, underlies 
8 situations commonly 
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à simple matter to undertake a factor analytic study of conditioning, with 
à variety of techniques, to decide the issue of the existence and strength 
of a general factor. There are, however, many diffculties in the way of a 
definitive general factor study, and some of the more important ones will 
Shortly be taken up. Before turning to these problems, the possible 
existence of group and specific factors of conditionability will be con- 


sidered, 


1. Functionally Specific Arousal Systems and the Possibility of 


Group and Specific Factors 


In a preceding section (II, A, 2, a), the probable existence of 


functionally specific arousal systems, in addition to a general arousal 
system, has been discussed. Whilst a degree of intraindividual consistency 
in the reactivity of these systems is to be expected, such consistency is 


likely to be far from perfect. To the exent that functionally specific 


arousal systems are of importance in the establishment of conditional 


linkages, intraindividual variability in the reactivity of these systems will 
reduce the strength of any general factor of conditionability. 

It seems then that it is reasonable to expect a general factor of 
conditionability, and, in addition, specific and/or group factors. It is 
Possible, if not probable, that there exists group factors associated with 
the broad defense-arousal and positive reward systems, in other words, 
factors related to aversive and appetitive conditioning. It is clear, how- 
ever, that differentiations exist within these broad systems (attention 
has already been drawn to the division within the feeding center). The 
Work of Olds (1956) is of interest here. Using the technique of self- 
Stimulation through implanted electrodes, Olds mapped out areas 
centering on the hypothalamus within which mild electrical stimulation 
has either positive or negative reward value. In later experiments, Olds 
(1958) investigated the possibility of differentiation within the positive 
reward areas. Response rates with given electrode placements were 

and low hunger and sex drive. 


determined under conditions of high à 
The results pointed to the existence of a localized hunger-reward system 


differentiated from a local sexual reward system with a reciprocally in- 
hibitory relationship between the two. It is possible that each arousal- 
ill be found to be associated with a specific factor of 


reward system w 
Conditionability. 

There are many reasons that the low inter 5 
been found between different types of conditioning measures do not 
Constitute decisive evidence against the existence of an important general 
factor of conditionability (Eysenck, 1962). Among the many influences 


rcorrelations which have 
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which, in addition to specific conditionability factors, will tend to atten- 
uate observed correlations, may be listed the following. 


a. Variability in Peripheral Response Mechanism Sensitivity. Since 


the development of a central conditional linkage can be measured only 


through the medium of some peripheral response mechanism, such as 


blinking of the eyes or activity of the sweat glands, it follows that any 
intraindividual variability in the sensitivity of the peripheral mechanisms, 
which is not highly correlated with variability in conditionability, will 
reduce the correlations between conditioning measures. 

b. Nonassociative Contamination of Conditioning Measures. To the 
extent that nonassociative factors no 
ability enter differentially into various types of conditioning measures, 
the observed relationship between the measures will be spuriously low. 
For example, different degrees of such confounding in negatively re- 
inforced and positively reinforced GSR's almost certainly reduced the 
correlation obtained by Lovibond (1963b) between these two types of 
conditioning. 


t highly correlated with condition- 
gny 


c. Central Set Influences in Conditioning Measures. In the discussion 
of central “set” and “attitude” factors in relation to conditioning, it was 
pointed out that different types of conditioning are susceptible to these 
influences to different degrees. Since set and attitude are unlikely to co- 
vary consistently with conditionability, these factors represent another 
possible source of attentuation of correlations between different types 
of conditioning measures. 

d. Procedural Variations. Among the many possible procedural 
dimensions with respect to which conditioning experiments can and do 
vary, the following appear to be some of the most important: (i) Strength 
of US; (ii) CS-US interval; (iii) length of intertrial interval; (iv) nature 
of the intertrial interval (silent, filled, or partially filled (see Section Ш, 
D, 2); (v) percentage of reinforcement. 

The influence of variations in the above 
from being fully determined for any one type of conditioning. Sufficient 
is known of their operation, however, to suggest that, at least to some 
degree, optimal conditions are likely to vary with the type of condition- 
ing. For example, intertrial intervals of around 90 sec, which appear not 
to be detrimental to eyeblink conditioning, are too short for optimal 
GSR conditioning (Prokasy, Hall, and Fawcett, 1962). Furthermore, the 
interactions between procedural variations are, as might be expected, 
far from simple. For example, although a CS-US interval of about 
450 msec has been shown to produce maximum performance in the con- 
ditioning of finger withdrawal (Fitzwater and Thrush, 1956; Spooner 
and Kellog, 1947), eyeblink conditioning (Kimble, 1947; Kimble, Mann 


aspects of procedure are far 
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and Dufort, 1955; McAllister, 1953), and GSR conditioning (Moeller, 
1954; White and Schlosberg, 1952), a recent study by Kimmel and Penny- 
packer (1963) has shown that differential conditioning of the GSR in- 
creases as a function of CS-US interval up to the maximum value studied 
(2 sec). Caution is in order, however, in interpreting GSR studies such 
as those of White and Schlosberg (1952), and Moeller (1954), because of 
the probability that conditional and orienting GSR's were confounded in 
the response measures. 

Pavlov (1927) apparently found no significant differences in ease of 
Salivary conditioning between CS-US intervals up to 5 sec, as CR's formed 


with these intervals were all termed “simultaneous” CR’s. No relevant 


data on human salivary conditioning is avaliable, and the issue of whether 


or not the optimal CS-US interval is the same for all types of condition- 
Ing remains open. 


For the purposes of investigating à general factor of condition- 


ability, it would appear desirable to seek optimal procedures for each 
technique, rather than to attempt a purely mechanical standardization 
Of procedures, Until such optima are firmly established, the procedures 
adopted with different techniques are likely to fall short of optimal 


Conditions by different degrees. Since this is unlikely to affect all indi- 


Viduals equally, intercorrelations between measures will be correspond- 


ingly reduced. 
2. Other Parameters of Conditioning Performance 


The concept of “conditionability” refers to the ease of acquisition of 


Conditional reflexes, i.e., the development of excitatory linkages. There 
performance which are of 


are, however, other aspects of conditioning 

relevance for personality functioning. Diamond et al. (1963) have argued 
Cogently for their view that inhibitory processes enter into every be- 
havioral act, and must be given equal consideration with excitatory 
Processes if we are to understand any aspect of psychological function- 
ing. The conditional reflex was developed by Pavlov as a methodological 
technique for studying the interaction of excitation and inhibition in 
higher nervous activity. Manifestations of inhibitory processes in condi- 
tioning were considered to occur in the phenomena of differentiation, 
the acquired capacity of a stimulus to suppress or delay a response to a 
ction of conditional reflexes, both with 
апа without reinforcement. These aspects of conditional reflex activity 
have received attention only occasionally in studies of conditioning and 
Personality. An exception is to be found in the work of Solymon and 
Beach (1961), who, in their studies of memory defect in the aged, have 
Investigated the conditioning parameters of rate of extinction, differen- 


1 "o à Н 
Ong-acting stimulus, and extin 
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tiation, inhibition of delay, and rate of extinction of the orienting reflex, 
in addition to rate of acquisition of a conditional reflex. 


It would seem that a concept of conditionability that deals only 
with excitatory processes ignores aspects of conditioning performance 
which, in the long run, could possibly account for more of the variance 
in personality functioning than excitatory measures such as rate of CR 
acquisition. 


D. PROBLEMS AND POSSIBILITIES OF 
‘TECHNIQUES 


1. Eyeblink 


Conditioning of the eyeblink to 
been widely used in personality 
method. It is stand 


SPECIFIC CONDITIONING 


à puff of air is a technique which has 
study, and it is the most standardized 
ard practice to use of CS-US interval of 450-500 msec, 
and an air puff of approximately 1 psi. Also a pure tone is commonly 
used as the CS, mainly because light CS's have been shown to elicit UR's 
under some conditions. The transducer may be mechanical (microtorque 
potentiometer) or photoelectric. Both systems are highly developed and 
generally satisfactory. The different types of response which occur in eye 
blink conditioning have been studied in detail. The most important 
criterion for distinguishing various types of response is latency, but form 
is also useful as a secondary criterion. The usually accepted categories 
of response and their latency ranges, with a 500-msec CS-US interval, 
are reflex blink to light (alpha response), 50-110 msec; beta response 
(reflex blink to light in dark-adapted eye), 120-240 msec; conditional 
response, 250-500 msec; voluntary response, 200-500 msec; unconditional 
response to puff, 50-100 msec after US. 


Whilst UR's to the CS can readily be distinguished from CR's in 
terms of latency, the differentiation of voluntary responses from CR's 
poses a problem. The accepted criterion of a voluntary response is that of 
possessing characteristics similar to responses given by Ss who are in- 
structed to blink to the CS in order to avoid the puff, ог who report that 
they responded voluntarily in order to avoid the puff. Since the majority 
of these voluntary responses have latencies from 200 to 3 
and Ross (1959) have proposed that o 
terval 300-500 msec after CS onset should be termed CR 


acteristically different 
of form is difficult to 


е criterion of latency. 
Random blinking poses another problem for ey 


and some experimenters reject Ss whose natural b] 


from that of a CR, but in many cases the criterion 
apply, and there has been a tendency to rely on th 
a > 


eblink conditioning, 
ink rate is above a 
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predetermined level. Since a high blinking rate has often been considered 
a sign of anxiety, it is clear that the investigator of relations between 
personality and conditioning faces something of a dilemma in reaching 
a decision concerning whether or not to reject Ss. A similar problem 
arises in connection with Ss who respond voluntarily to the CS before 
training. It may happen that such voluntary responses are related to the 
personality parameter under study. For example, Willet (1963), in his 
study of the effect of high and low drive on eyeblink conditioning, re- 
jected any S whose natural blink rate was faster than one blink every 
500 msec, He reports that he rejected 7 of 62 Ss in the high drive group 
e or because they responded to the CS signals 


"through apparatus failur 
' Of the 60 low-drive Ss, on the other hand, 


during the control trials.’ 


"five were rejected through apparatus failure." 
The personality psychologist's dilemma will be solved only by more 


fundamental work on the differentiation of associative and nonassociative 


factors in conditioning, and the relation of these factors to dimensions of 
personality. It will probably be necessary to recognize, however, that there 
are some Ss from whom adequate measures of eyeblink conditioning can- 
not be obtained, just as some Ss have an insufficiently defined alpha block- 
ing response to permit EEG conditioning, and some Ss produce insuf- 
ficient GSR’s to make GSR conditioning possible. 


2. Galvanic Skin Response 

The galvanic skin response (GSR) rivals the eyeblink in popularity 
h human Ss. The usual finding is that the 
1 with an electric shock as the US, and it 
auditory stimulus in place of the 
hat it varies continuously over a 


as a conditioning procedure wit 
GSR can readily be conditionec 
can also be conditioned with a loud 


shock. An advantage of the GSR is t 
Wide magnitude range, and hence lends itself to quantification. Never- 


theless, there have been endless debates concerning the appropriate unit 
of measurement of the GSR. Investigators have laid down criteria for 
Suitable measures, such as independence of basal resistance and normal- 
ity of distribution. Units commonly employed have included percentage 
change in conductance, the square root of con- 
ductance change, and log conductance change. Various ratio measures 
have also been proposed. A large-scale study by Eysenck (1956) showed 
that the extent to which various measures met the usual criteria varied 
With the experimental conditions, and that none had any marked ad- 


vantage over the others. 
In the author's view, 

Measurement cannot be decided in the 

Personality study offers the possibility of provid 


change in resistance, 


the problem of the most appropriate unit of 
absence of an external criterion. 
ling such a criterion. If, 
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for example, it were possible, on the basis of personality theory, to define 
groups with widely differing conditioning performance, these groups 
could be used as external criteria. That is to say, the GSR measure which 
most effectively and consistently separated samples from the groups in 
terms of conditioning would be the unit of choice. Until some such basis 
for a rational choice exists, it would seem reasonable to use the unit of 
measurement most convenient for one’s purpose. 

GSR conditioning is the least standardized of all the conditioning 


procedures. Whereas, in the case of eyeblink conditioning, the favored 


CS-US interval is almost always around 450-500 msec, this interval may 
vary from 450 msec to 


20 sec in GSR conditioning. The characteristics 
of the US also vary considerably from experiment to experiment. Thus 
the electric shock US may differ in current type (for example, alternating 
or direct current, condenser discharge), electrode type, electrode place- 
ment, intensity, and duration. If the current type conforms to a standard 
pattern, for example 60 cycles ac, it can be specified adequately in terms 
of amperage if the electrode size and position are known. In many cases, 
however, the electric shock used to condition the GSR is unspecifiable. 

The range of intertrial intervals commonly used in GSR condition- 
ing is not great. The average is about 45-60 sec, but the interval may be 
as short as 15-25 sec (Prokasy, Hall, and Fawcett, 1962). The importance 
of the intertrial interval is emphasized by the failure of Prokasy et al- 
(1962) to find a difference between conditioning and pseudoconditioning 
groups with the very short interval of 15-25 sec, whereas separation of 
conditioning and pseudoconditioning groups was achieved with longer 
intertrial intervals in a later experiment (Wickens, Allen, and Hill, 1963). 

What happens during the intertrial interval is another important 
source of difference between GSR experiments. Тһе interval may be 
either "silent" or "filled." The term "silent" simply refers to the proce 
dure, usual in other forms of conditioning, in which no other stimuli are 
presented between punctuate trials of CS, CS-US, or US. “Filled” (or 
"partially filled") intertrial intervals, оп the other hand, are those in 
which a series of stimuli similar to the CS is presented between trials of 
CS, CS-US, or US. In the most common experiment of this type, the CS 
is a word, and other words are used to fill the intertrial interval (Becker, 
1960; Becker and Matteson, 1961; Vogel, 1960, 1962). It can be seen that 


the "filled interval" experiment is Simply one in which multiple differ- 
ential stimuli are presented. 


The question of filled versus unfilled 
portance in the light of recent concern over 
from nonassociative factors in GSR condition 
that the GSR may be elicited by 


intervals assumes some im- 
the separation of associative 
ing. It has long been known 
any novel stimulus, but electrophysio- 
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logical studies of the mechanisms of arousal and the orientation reflex 
have given this fact a new significance. The usual practice in GSR con- 
tioning is to present the stimulus to be conditioned a number of times 
prior to conditioning in order to adapt or habituate the GSR to this 
stimulus. After the adaptation trials, the conditioning phase is begun 
with several CS-US pairings, followed by the first test stimulus (CS 
presented alone). Any increase in the GSR to the test stimulus, beyond 
the level of response to that stimulus after adaptation, is attributed to 
conditioning. However, recent work by Sokolov (1960) and others has 
emphasized the extent to which the habituated orienting reflex, of which 
the GSR is a manifestation, is re-evoked by any relatively sudden change 
in the stimulus input. Evidence that the components of the orienting 
reflex can be habituated in different patterns, depending on the sensory 
input, suggests the operation of selective inhibitory mechanisms. To 
account for these and other findings, Sokoloy (1960) has proposed the 
concept of “neuronal model.” The neuronal model is a cortical repre- 
sentation, by a pattern of excitation, of recent past experience, against 
which incoming stimulation is compared. As long as the incoming stim- 
ulation corresponds to the model, it is checked by inhibition at the level 
of the brain stem collaterals, but in the case of a lack of concordance, 
afferent impulses are fed into the arousal system, which then discharges 
into cortical centers, increasing capacity for discrimination and causing 
the formation of a new model more consonant with the new stimulus 
input. : 
Grings (1960) has proposed a related concept which. he terms the 
perceptual disparity reaction. Applied to the GSR. conditioning situa- 
tion, these concepts suggest, first, that the first reinforcement following 
an adaptation series will release the orienting reflex from inhibition, 
and this release will be reflected in subsequent response measures. In 
addition, moreover, any change in the pattern of stimulation, including 
the omission of a stimulus, will tend to disinhibit the orienting reflex. 
onal test trials of CS alone, in a series of CS-US 
Pairings, will evoke an orientation reflex which will be confounded with 
2. are massed at the end of a condition- 


àny CR which occurs. If test trials c кеш d 1 

ing sequence in the [orm of an extinction series, the initial trial will tend 

to oat ie a strong orientation reflex, followed by the development of 
as g 


habituation on subsequent trials. Any conditional GSR's, and their sub- 
sequent experimental extinction, will be confounded with the orientation 


GSR’s and their habituation. | 

It would be expected that the strength of the re-establishment of the 
Tienting reflex by a change in the stimulus conditions will depend on 
the extent of EM disparity between the original and the new stimulus 


This means that occasi 
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conditions, and the suddenness of the change. Consider, for example, the 
experiment of Prokasy et al. (1962) which compared the GSR's, in an 
extinction series, of groups given 

(a) tone-shock pairings (regular conditioning), 

(b) shock-tone pairings (backward conditioning), 

(c) random shocks and random tones (sensitization), 

(d) shocks only (pseudoconditioning), and 

(e) tones only. 

The ordering of the groups from highest to lowest GSR on the first 
extinction trial was shock only; backward conditioning; regular condi- 
tioning; random shocks and random tones; and tones only. This is pre- 
cisely the ordering that judges give if they are asked to rank the 
groups in terms of decreasing perceptual disparity between training and 
extinction test trials. In the experiment of Prokasy et al., the response 
levels of the shock-only and backward-conditioning groups decreased 
rapidly compared with those of the true conditioning group. The re- 
sponse level of the random group tended to remain relatively high, 
possibly as a result of trace conditioning with the very short intervals 
used. 

Stewart, Stern, Winokur, and Fredman (1961) have argued recently 
that “so far work on GSR conditioning has dealt with the adaptation and 
recovery of unconditioned responses rather than conditioning of re- 
sponses.” The argument that no unequivocal evidence of GSR condition- 
ing had previously been produced could be sustained only by ignoring the 
results of long-delay GSR conditioning experiments (Rodnick, 1937, 
Switzer, 1934), which measured anticipatory responses similar to those 
advocated by Stern et al., and also the results of differential GSR. con- 
ditioning experiments. The latter include experiments using a single 
differential stimulus (Kimmel and Pennypacker, 1963; Lovibond, 19632), 
and those using multiple differential stimuli ("filled" intertrial intervals) 
(Becker, 1960; Becker and Matteson, 1961; Vogel, 1960, 1962). 

Further unequivocal evidence of GSR conditioning has been ob- 
tained by experiments in which an extended extinction test series has 
demonstrated a significantly higher level of response in a forward con- 
ditioning group by comparison with a backward conditioning group 
(Prokasy et al., 1962). The backward conditioning procedure provides an 
acceptable control, since the perceptual disparity between training and 
extinction is greater with this procedure than with forward conditioning, 

From the foregoing analysis it would appear that incontrovertible 
evidence of GSR conditioning (other than differential conditioning) can 
most readily be obtained if the stimulus relations remain constant 
throughout the experiment, or if the changes in stimulus pattern are 
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below the level of discriminability of the "neuronal model" or orienting 
reflex comparator mechanism. 

In practice, experiments of the first type are limited to those employ- 
ing the delayed conditioning method in which anticipatory CR's are 
distinguished in terms of latency. It is necessary, of course, to employ a 
sufficiently long CS-US interval to avoid masking of anticipatory CR's 
by orienting GSR's. Conditioning with delays of 10 sec or longer presents 
no great difficulty, and contradicts the belief that GSR conditioning can 
be obtained only with a CS-US interval of 450—500 msec. 

It seems likely that the function relating CS-US interval and strength 
of conditioning rises to a new peak beyond intervals of 4-5 sec, but 
curiously enough nobody has investigated this possibility. In considering 
the problem of CS-US interval, however, it should be borne in mind that 
measures of conditioning derived from short intervals will necessarily 
be confounded with orienting GSR's if silent intertrial intervals are used. 
This suggests that examination of the CS-US interval function can best 
be carried out with experiments which reduce the suddenness of changes 
in the stimulus pattern below the point where orienting GSR's will be 
evoked. This implies filling the intertrial interval with multiple differ- 
ential stimuli to produce a "constancy in change." Current work in the 
Adelaide laboratories suggests that under these circumstances orienting 
GSR's will be almost entirely suppressed in many Ss. The fact that CR's 
appear to form readily enough against this background suggests that 
generalized arousal may not be necessary for the establishment of condi- 
tional linkages, and that the inhibition which suppresses the orienting 
reflex may have little or no effect on the elicitation of the GSR by specific 
arousal mechanisms. It must be stressed, however, that there is as yet 
insufficient evidence to permit firm conclusions concerning these possi- 
bilities. 

From the point of view of personality study, the advantage of long- 
delay and filled-interval GSR conditioning procedures is that they give 
relatively pure measures of conditioning, and make possible a measure 
of rate of acquisition of the CR. Although the strength of orienting CR's 
is likely to be positively correlated with rate of conditioning (Lovibond, 
1963a; Martin, 1963), confounding of the two measures is clearly un- 
desirable if one wants to test theories relating personality and condition- 
ing, or to investigate the generality of conditionability. 

Positive reward conditioning of the GSR has been investigated by 
Lovibond (1963a,b) using still pictures of nude females as the US. Using 
à delayed differential conditioning procedure with silent intervals, it was 
found that the intrusion of orienting GSR's was much less pronounced 
than in the case of shock conditioning. Without distingushing the la- 
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tencies of responses, a rising curve of response to the CS was obtained, 
together with a falling curve of response to the differential stimulus 
(DS). This technique promises to be a useful supplement to salivary 
conditioning in the investigation of positive reward conditioning. A 
"filled interval" moving film version is at present being investigated. 
In this form of the technique, short sequences of moving pictures of 
nudes are separated by intervals of neutral shots of outdoor scenery. 
Both visual and auditory long-delay conditional and differential stimuli 
are being used. Substitution of a shock US in this procedure should make 
possible a more satisfactory comparison of positive and negative condi- 
tioning of the GSR, since the usual confounding of orienting and con- 
ditional GSR's with aversive reinforcement should be markedly reduced. 

A further response measure which might be used with this form of 
reinforcement is change in penis volume. Freund (1963) has reported on 
the use of a penile plethysmograph in the differentiation of homosexual 
from. heterosexual interests. According to Freund (1962), change in 
penis volume is a more sensitive index of arousal by pictures of nudes 
than is the GSR. Since this response should be relatively independent 
of the generalized arousal or orienting response, its comparison with 
the GSR offers interesting possibilities in the investigation of generalized 
versus specific arousal in the formation of conditional reflexes. 


3. Salivary Response 


Since the principles of conditional reflex activity were derived by 
Pavlov from studies of salivary conditioning in animals, there would 
appear to be obvious advantages to be gained from the adoption of this 
method in the study of human conditioning. However, the salivary con- 
ditioning of human Ss poses some difficult methodological problems. 

In early attempts to study saliva y conditioning in man (Finesinger 
and Sutherland, 1939; Lentz, 1935; Richter and Wada, 1924), salivary 
secretion was measured by means of the suction cup method originally 
devised by Krasnogorski (1931). In the modified version of this method 
published by Finesinger and Finesinger (1937), capsules, held by suction 
over Stenson's ducts, collect the secretion of the parotid glands. The saliva 
collected is transmitted by rubber tubes to a bottle filled with saline 
solution. The displaced saline passes through a tube with a capillary tip 
into a burette. Тһе passage of each drop of saline closes an electric 
circuit between two platinum wires in the wall of the burette, and sends 
a pulse to a kymograph pen. A measure of rate of salivary secretion is 
thus given by the number of pen movements per unit time. Lemon, or 
other fruit juice, which serves as the US, is conveyed to the patient's 
mouth via a third tube positioned by the suction cup. 
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In an experimental application of the technique, Finesinger et al. 
(1942) used a CS (metronome) presented for 3-10 sec in a delayed con- 
ditioning procedure, with a l-sec CS-US overlap. The magnitude of the 
CR was measured by presenting the CS for 25 sec on test trials. The 
amount of saliva secreted during this period was corrected by subtraction 
of the average quarter-minute rate of secretion during the minute pre- 
ceding the presentation of the test CS. 

Razran (1935) published an account of a new technique for measur- 
ing salivary secretion in conditioning experiments with human Ss. Sali- 
vation was measured by means of a standard dental roll inserted under 
55 tongue. The roll was weighed before insertion and after removal, 
the difference in weight being taken as a measure of secretion during the 
intervening period. Razran noted considerable variation in the magnitude 
of the salivary CR, and believed that a good deal of this variation could 
be attributed to the influence of central "attitude" or "set" factors. When 
the attention of S was directed elsewhere, so that there was minimal aware- 
ness of the stimulus relations of the experiment, the stability of the CR 
improved (Razran, 1936). 

The most recent studies of human saliv 
et al., 1955; Willet, 1960) have used the cotton roll method. Willet's care- 
fully controlled study showed that increased salivation occurred in the 
presence of the CS when no reinforcement was used. The experiment of 
Bindra et al. was replicated in the Adelaide laboratories with the addi- 
tion of a control group. The Ss in the control group were presented with 
the CS and US separated by random intervals rather than in conjunc- 
tion. The curves of both experimental and control groups (N — 20 in 
ally identical in form to those obtained by Bindra 
et al. On the first test trial, after 8 reinforcements, the amount of saliva- 
tion had reached a maximum which was sustained through the next 32 
CS-US pairings. The difference between the experimental and control 
Broups was not significant, but the response level of the control group 
was consistently higher than that of the experimental group. In а 
comparable experiment, with a test trial after each reinforcement, most 
of the increase in salivation to the CS occurred after the first reinforce- 
ment, and no further increase occurred beyond three reinforcements. 
This pattern conforms closely to that observed in the responses of "sen- 
sitization" control groups used in GSR conditioning. 

In a further unpublished study, John (1962) used a similar proce- 
dure with 80 experimental Ss and 80 control Ss. During the conditioning 
phase, salivation to the CS increased over 12 reinforced trials, but the 


rate of increase was significantly greater in the control group. A possible 
ed from Pavlov's (1927) observation 


ary conditioning (Bindra 


each group) were virtu. 


explanation of this result can be deriv 
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that if a CS of relatively long duration overlaps rather than precedes a 
relatively long US, the CS becomes inhibitory. John’s Ss were instructed 
to start sucking a lollipop (US) on a given signal, and the CS and US were 
presented simultaneously for 2 min. Under these circumstances it is 
likely that the development of inhibitory potential by the CS offset the 
development of excitatory potential by this stimulus, by way of either 
sensitization or CS-US pairing. 

The foregoing studies demonstrate quite unequivocally the occur- 
rence of nonassociative contamination in the response measures of salivary 
conditioning experiments such as that of Bindra, but they do not eluci- 
date the sources of this contamination. There are at least three possible 
explanations of the increased response to the CS during the conditioning 
phase observed in the control group Ss of the Adelaide and John experi- 
ments. The first two emphasize the possibility of measurement artefact, 
and the other suggests the occurrence of some form of pseudocondi- 
tioning. 

First, when the intertrial interval is short, there may be a carryover 
of unconditional salivation into the period of CS test presentation, The 
intertrial interval in the experiments in question, as in the Bindra study, 
was only about 1 min. In this connection, Finesinger et al. (1942), using 
à suction cup method, have observed that sometimes salivation persists 
for “а considerable time" after presentation of the US. Second, presenta- 
tion of the US (such as a lollipop) to a hungry person may sensitize the 
salivary reflex to the point where it will be elicited by any object. (such 
аз а cotton roll) placed in the mouth. Third, reinforcement may sensitize 
the salivary reflex to the previously indifferent CS, thus producing a form 
of nonassociative response increment similar to the sensitization that has 
been observed in aversive conditioning. 

The possible contribution of each of these factors to response con- 
tamination in cotton roll salivary conditioning could readily be sub- 
jected to experimental analysis. If it is found important, the factor of 
carryover of salivation could be controlled by longer intertrial intervals. 
Sensitization would need to be controlled by such measures as the use of 
long-delay conditioning. As it is a reasonable inference from accounts of 
Pavlov's work that sensitization could scarcely have been a problem in 
work with dogs, it might seem a priori that it should not be a problem 
with human beings. There is, however, other evidence to suggest that the 
salivary reflex may be significantly more sensitive in human beings than 
in animals. For example, Finesinger et al. (1942) found that not a few 
Ss secrete saliva continuously irrespective of the external situation. Some 
degree of "interval" or intertrial secretion was so common that it was 
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necessary to correct for its effect on response measures. As Finesinger et al. 
point out, Pavlov makes no mention of interval secretion, and it is a 
reasonable inference that it was not a problem. 

1f, as seems likely, it is found that measurement artefact results from 
the use of cotton rolls themselves, there will be no choice but to return 
to the direct measurement of salivary secretion. This would appear highly 
desirable in any case, since direct measurement has a number of im- 
portant advantages over the cotton roll technique. In the first place, con- 
tinuous monitoring of salivary flow enables selection of the intertrial 
interval necessary to avoid carryover of unconditional salivation. Second, 
it gives a precise measure of interval secretion, and thus makes possible 
correction for background level of salivation. Third, the method of direct 
presentation of the US, which is a natural corollary of direct recording, 
permits far more precise control over the timing and amount of reinforce- 
ment than can be achieved with the cotton roll method. Fourth, the 
whole procedure can be carried out without any movement whatever on 
the part of S. Once $ has adapted to the tubes and suction cups in his 
mouth, the course of conditioning is uninterrupted by any extrancous 
stimuli. In other words, we have a situation similar to that which Pavlov 
found essential for the salivary conditioning of dogs. By contrast, the 
cotton roll method , as it is commonly used, requires § to insert some 
solid reinforcing agent (lollipop or tablet) into his mouth, and to begin 
sucking on a given signal. In addition, 5 not infrequently has the US in 
view and is required to engage in some task. It seems not unlikely that 
these varied patterns of stimulus input will either facilitate or inhibit 
salivary flow, and account for at least some of the variability of response 
which is commonly noted. The results obtained with the suction cup 
method make it clear, nevertheless, that there is a good deal of inherent 
variability in the human salivary response. A direct experimental com- 
parison of the suction cup and cotton roll methods would be most 


instructive. 

It is of interest to consider possible modifications to the Finesinger 
and Finesinger suction cup method in the light of recent developments 
in instrumentation. Possibilities which suggest themselves include (a) sub- 
stitution of an inkwriting rate recorder for the kymograph; (b) presenta- 
tion of the liquid US by a small, fixed-delivery pump, automatically 
timed; (c) substitution of a plastic cannula for the suction cup. It is not 
immediately obvious what improvement could be made on the original 
method of transducing salivary flow into electrical impulses, but it is 
quite likely that a superior method exists or could be developed. It is 
clear that, modern developments notwithstanding, direct salivary record- 
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ing methods are technically demanding. Nevertheless, the available evi- 
dence suggests that serious application of salivary conditioning procedures 
in the study of personality can scarcely be undertaken without them. 


4. Alpha-Blocking Response 


The early work of Durup and Fessard (1935) and Loomis, Harvey, 
and Hobart (1936) suggested the possibility of conditioning the blocking 
or desynchronization of the alpha rhythm, for which a strong light is an 
adequate stimulus. 

In most experiments a tone has been used as the conditional stimulus. 
An extensive exploratory study was undertaken by Jasper and Shagass 
(1941) to (a) investigate “the depression (blocking) of the alpha rhythm 
in man as a conditional response, and (b) appraise the value of the 
electroencephalograph as an instrument for the study of conditioning 
problems, especially in man." Jasper and Shagass claimed to have 
demonstrated practically all of the Pavlovian conditioning phenomena, 
including temporal, trace, delayed, differential, and backward condi- 
tioning. In their view, the advtantages of EEG conditioning lie in the 
facility and rapidity with which conditioning of alpha blocking may be 
achieved, and its relative independence of voluntary influence, The main 
disadvantages are the instability of the CR and the fact that S must 
exhibit “ап almost continuous alpha rhythm which is free from ‘spon- 
taneous’ variations, and which responds clearly to visual stimuli” before 
unequivocal conditional responses can be obtained. 

Most investigators who have studied EEG conditioning in man have 
found the CR to be “poorly sustained” (Wells and Wolff, 1960a) or "un- 
stable" (Knott and Henry, 1941). Knott and Henry (1941) 
ful study, drew attention to the possibility of sensitiz 
to the tone in EEG conditioning. They noted that alpha blocking can be 
produced by a wide variety of stimuli. In fact, the tones ordinarily used 
as conditional stimuli are by no means "indifferent" stimuli; they are 
only relatively ineffective in comparison with strong light 
order to differentiate between sensitization of the previously 
sponse to the conditional stimulus 


, in a very ca 
ation of the response 


stimuli. In 
adapted re- 


, and conditioning proper, Knott and 
Henry used the method of delayed conditioning with a 4-sec CS-US 


interval. This interval permitted full recovery of the alpha rhythm after 
its response to the CS and before the US occurred. The results gave 
rather clear-cut evidence of both sensitization of the initial response to 
the CS and the development of a conditional anticip 
I-sec interval prior to US onset. 

The curve of the sensitized response to CS onset showed 
rise to approach its maximum during the first five trials 


atory response in the 


an abrupt 
Ы although the 
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peak did not occur until trials 21-30. From this point there was a slow 
decine in the response. The anticipatory response showed a negatively 
accelerated increase to reach its peak in trials 21—30, followed by a fairly 
rapid decline. 

Despite this early demonstration that nonassociative factors such as 
sensitization must be controlled in EEG conditioning, not all subsequent 
investigators have heeded the warning. Thus the Wells and Wolff (19602) 
study of EEG conditioning in brain-damaged patients employed a CS- 
US interval of only .8 to 1 sec, and did not include a control group. À 
temporary cerebral connection was considered to have occurred "if the 
alpha rhythm was obliterated or strikingly depressed after the presenta- 
tion of the tone and before the appearance of the light." The "condi- 
tioning” curve obtained showed a decline from an initial peak value 
similar to the usual adaptation curve. Stern, Das, Anderson, Biddy, and 
Surphlis (1961) questioned whether Wells and Wolff had demonstrated 
the development of a conditional cerebral response, and undertook a 
replication of the Wells and Wolff experiment. Their results showed a 
decline in the desynchronization response to tone over the 20 adaptation 
trials, followed by a rise to above the preadaption level in the first five 
trials of tone-light pairing, and a subsequent rather regular decline over 
the next 40 trials. Since the curve of the response to the tone almost 
exactly paralleled that of the response, to the light, Stern et al. interpret 
the rise in tone response, during the early stage of tone-light pairing, 
to a return of the adapted “orienting response” analogous to that ob- 
served in GSR conditioning. 

The implication of these results is that Wells and Wolff did not 
provide unequivocal evidence of EEG conditioning, and hence, did not 
provide evidence concerning the differential susceptibility to this type of 
conditioning of brain-damaged patients and normal controls. Since Wells 
and Wolff (1960b) used an identical procedure in their study of anxiety 
and EEG conditioning, it follows that this study too failed to produce 
evidence of differential rates of conditioning in anxious patients and 
normals. 

It should be obvious that Wells and Wolff have simply failed to 
disprove the null hypothesis of no difference between their clinical groups 
and normals. It is probable, however, that there is a positive correlation 
between disinhibition of adaptation and rate of conditioning [as Lovi- 
bond (1963a) has demonstrated for positive reward conditioning of the 
GSR] and that results similar to those of Wells and Wolff would be ob- 
tained with unequivocal measures of EEG conditioning. 

It is clear from the foregoing that conditioning of alpha desyn- 
chronization is susceptible to masking by nonassociative factors in much 
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the same way as the conditioning of other components of the orienting 
reflex and also the defensive reflex. When it is considered that only about 
60% of normals produce EEG records with alpha rhythm present 5076 
of the time (Davis, 1938; Henry and Knott, 1941), it is clear that condi- 
tioning of alpha blocking leaves much to be desired as a method of 
personality study. 

Nebylitsyn (1963) describes a method which he claims improves the 
stability of the conditional reflex connection in EEG conditioning. He 
notes that the reaction of depression or complete disappearance of the 
alpha rhythm always occurs upon the presentation of a light stimulus, 
and, under conditions of "novelty," when a stimulus of any modality 
appears. "In the former instance this reaction possesses virtually all the 
properties of an unconditional reflex, while in the second case it pos- 
sesses those of an orientation reflex" (Nebylitsyn, 1968, p. 23). In subse- 
quent discussion, however, Nebylitsyn points out that * 
nificant diminution" of the “unconditional” 
rhythm usually occurs with repeated presentation of the light stimulus. 
Consequently, he uses what he terms "activating reinforcement" in the 
elaboration of conditional blocking of the alpha rhythm. Activating re- 
inforcement refers to a reinforcement that “supports the orientation- 
investigatory reaction of the individual and thereby makes for stable 
retention of the conditional reflex.” A series of slide pictures displayed 


on a screen is employed for the purpose of activating reinforcement, and 
Nebylitsyn (1962) claims that more than 90% 


© of Ss with a reasonably 
clear-cut alpha rhythm develop stable CR's using this method. 


rapid and sig- 
depression of the alpha 


IV. Changing Personality Functioning by Conditioning Methods 


A. EXPERIMENTAL NEUROsES 


One of the earliest applications of conditioning procedures to prob- 
lems of personality was the study of experimental neuroses. 

This line of research began with some Observations made in Pavlov's 
laboratory while training a dog in a difficult differentiation (Pavlov, 
1927). After a CR of salivation to a circle was formed, a series of ellipses 
with different semiaxis ratios was presented as differential stimuli. Fol- 
lowing the usual practice, simple differentiation was first established 
by reinforcing the circle and omitting reinforcement to an ellipse with a 
ratio of 2:1 between the semiaxes. 

In following trials the shape of the ellipse was changed by steps until 
it approximated that of the circle. When an ellipse with a ratio between 
the semiaxes of 9:8 was presented the dog’s behavior suddenly 


P rell Es D 3 changed 
markedly. Its capacity for differentiation deteriorated, and 


its general 
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behavior changed from quiet cooperation to violent resistance. Pavlov 
used the term "experimental neurosis" to characterize the sudden dis- 
turbance in behavior which developed under the given experimental 
conditions. 

In another experiment, a salivary CR to a strong electric shock was 
established, but when an attempt was made to generalize the CR. by 
applying the shock to other parts of the skin, a breakdown of behavior 
occurred. The salivary CR disappeared and was replaced by a violent 
defensive reaction even in response to weak stimuli. Pavlov interpreted 
these observations as demonstrating that the clash of excitation with 
inhibition led to a profound disturbance of the usual balance between 
these two processes "and finally gave way to an undisputed predomi- 
nance of one of them, producing a pathological state." 

A comprehensive and detailed review of later work on experimental 
neuroses (Liddell, 1947, 1956; Gantt, 1942, 1944; Masserman, 1943; Wolpe, 
1952, 1958) has been published recently by Broadhurst (1960). Conse- 
quently the present treatment will be confined to a brief discussion of 
some of the issues of general importance which have arisen in relation to 
this work. 

Broadhurst very laudably sets out to establish criteria for determining 
whether or not experimentally induced behavioral disturbances in ani- 
mals may properly be termed neuroses. In this endeavor he is guided by 
Hebb (1947), who lists the following six criteria of human neuroses: The 
behavior must be (a) undesirable, or evaluatively abnormal, (b) emotional, 
(c) generalized, i.e., not solely a response to a specific excitant, (d) persis- 
tent following cessation of the specific excitant, (e) statistically abnormal, 
ie., occurring in a minority of the population, (f) without origin in a 
gross neural lesion. Broadhurst, with Hebb, rejects the following three 
common criteria: (i) neurosis has no known physiological base, (ii) neu- 
rked change of behavior from early base line, (iii) neu- 
“traumatic” experience like conflict or frustration. 
concludes that a great deal of the work 
1 experimental design, 


rosis produces a ma 
rosis follows from some 

Broadhurst very properly 
in the field has lacked rigor and sophistication ir 
and has been inadequately reported. Furthermore, many of the behavior 
disturbances produced fall far short of warranting the description experi- 
mental neurosis. However, Broadhurst meets certain difficulties in the 
application of Hebb's criteria. Because of lack of data concerning the 
behavior of experimentally neurotic dogs external to the experimental 
situation, he is forced to conclude that there is insufficient proof that the 
avior produced in Pavlov's laboratories can properly 


breakdowns of beh 
y arises, of course, from the fact 


be described as neuroses. This difficult 
that Pavlov himself emphasized criteria not included in Hebb's list, 
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particularly the criterion of disorders in conditional reflex activity other 
than that activity through which the disturbance was produced. An ex- 
ample was the failure of the dog in the original experiment to perform 
even the simplest differentiation which had previously been performed 
with ease. Other examples were the disappearance of all recently trained 
reflexes, either inhibitory or excitatory, and performing a negative re- 
sponse to a positive stimulus, and vice versa. For Pavlov these were the 
essential criteria, and it is of interest that nothing like them occurs in 
Hebb's list. The closest approach to this sort of criterion would be one 
of those rejected by Hebb, namely, "neurosis produces a marked change 
in behavior from an earlier baseline." Certainly, as it is formulated by 
Hebb, this criterion would fail to distinguish the behavior under con- 
sideration from a number of nonneurotic behavioral changes, but it 
would seem to do so if reformulated along the following lines: "inability 
to perform tasks which were performed with ease prior to breakdown, 
or which are performed with ease by individuals of similar behavioral 
capacity." This would seem to be a not unimportant criterion of human 
neurosis. If a patient with an hysterical paralysis is unable to lace his 
shoes, a practiced accountant is unable to add a few simple figures, or a 
dweller in a large city is unable to enter and operate an automatic ele- 
vator, we have examples of this sort of behavioral deficiency. 

Broadhurst rejects the various theoretical interpretations of experi- 
mental neuroses which have been made previously, and seeks to account 
for the phenomena in terms of conditional emotional responses. Both 
Hebb (1947) and Wolpe (1952) earlier offered a similar interpretation. 
Hebb was forced to make an exception in the case of breakdown occa- 
sioned by training in very difficult differentiation as originally reported 
by Pavlov. Wolpe, however, in support of his thesis that all experimental 
neuroses, and all human neuroses, can be regarded as maladaptive learned 
behavior patterns, attempts to bring the difficult differentiation case 
within his general framework. Wolpe classifies situations which precipi- 
tate experimental neuroses into those which use strong shock, those 
which use mild shock, and those which present ambiguous stimuli. Ex- 
periments of the first type produce conditional fear or anxiety, ie. 
stimuli associated with the strongly aversive shock come to elicit intense 
anxiety by virtue of this association. (It is of interest to note that the 
Pavlovians draw a sharp distinction between experimental neuroses in 
which there is evidence of generally disordered nervous activity as pre- 
viously discussed, and manifestations of "defense reflex" activity un- 
accompanied by signs of general disorder.) 

In order to explain how the repeated applic 


ation of mild shock can 
eventually give rise to a severe anxiety 


reaction, Wolpe postulates that 
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"when emotional conditioning first begins to be established, the condi- 
tioned stimulus evokes a small amount of anxiety. Let us call this x, and 
let it be half the amount evoked by the shock (2x). At this time, when 
the shock follows shortly on the conditioned stimulus, the anxiety from 
the two sources will add together, giving a total of 3x. As the result of 
conditioning, the next presentation of the conditioned stimulus will 
evoke, say, 14x, and the shock that follows will bring the total to 33x. It 
may then be found that the next presentation of the conditioned stimulus 
evokes 21x, that is, more than the shock itself. By repetition, the amount 
of anxiety evocable by the conditioned stimulus will gradually rise to a 
magnitude many times 2x. At some stage the anxiety will be great enough 
to produce obvious signs, and as experimenting continues these will be 
intensified." 

Wolpe's interpretation of the third precipitating situation, that of 
"ambiguous stimulation," is as follows. However slight and transient the 
anxiety produced by this conflict of tendencies may be at first, the drive 
reduction associated with its cessation results in the anxiety responses 
becoming reinforced to whatever stimuli are contiguous, and among these 
is the ambivalent stimulation itself. In the same way as suggested above 
for the case of mild noxious stimuli, the strength of the anxiety response 
is conceived to be gradually stepped up at each presentation of the am- 
bivalent stimulus situation while the drive-reduction potential corres- 
pondly grows. Eventually very powerful anxiety responses are evoked, 
avoidance behavior entirely replaces the approach responses 


and stron 
: hich were previously established by re- 


to the experimental situation W. 
peated feeding (Wolpe, 1952, p. 266). 

Broadhurst does not regard Wolpe's interpretation as wholly satis- 
factory, and he suggests an alternative way of bringing the behavior dis- 
turbance resulting from ambiguous stimulation into the general category 
of conditional emotional reactions. Broadhurst's hypothesis involves the 
a reformulation of the Yerkes-Dodson law. According to 


application of ECCO 
lationship between motivation and 


this law, there is a curvilinear re 
performance. Performance improves with increased motivation up to an 
optimum, beyond which further increase in motivation results in poorer 
performance. Second, the more difficult the task the lower the optimum 
level of motivation. In Hullian terms, “increasing drive level may in- 
crease the functional difficulty of a discrimination, so that objective in- 
creases in difficulty are in fact rendered more severe than the circum- 
stances otherwise warrant. Once the optimum motivation is reached, then 
the decrease in task performance may be viewed as an energization of 


incorrect responses which are lower in the habit family hierarchy than 


the correct ones” (Broadhurst, 1960, p. 754). 
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Broadhurst's application of these principles to the difficult differ- 
entiation situation is as follows. Initially S's fear reactions to the restraint 
and to the experimental situation generally are gradually inhibited and 
replaced by food-seeking responses. In training the discrimination, the 
difficulty of the task progressively increases. Optimum performance would 
thus require a progressive lowering of motivation, but as S is likely in 
the past to have encountered mainly simple situations requiring high 


drive for optimum performance, he is likely to have learned to respond 


to failure by increased motivation. Failure to receive reinforcement for 


wrong response to the negative stimulus thus results in increased drive, 


resulting in further failure and so on. Broadhurst suggests that at a certain 


stage “the increased drive level energizes incorrect responses in competi- 
tion with the correct ones—responses, especially fear ones, which were 
suppressed early on in the training. So it is that struggling, howling, and 
agitated behavior in general, which were previously characteristic of the 
subject's initial reactions to the sitaution are reinstated anew” (Broad- 
hurst, 1960, p. 755). 

Finally, in relation to the finding that after the behavior disturbance 
occurs $ is unable to make even simple differentiations, Broadhurst says, 
“It seems probable that this characteristic failure of a simple discrimina- 
tion can be accounted for if it is recalled that even relativ: 
have 


ely simple ta 
an optimum motivation, which may well be surpassed if anxiety 15 
added to the motivational complex.” 


The attempts of Wolpe and Broadhurst to account for the phenom- 
ena of difficult differentiation neuroses in terms of le 
reactions, although ingenious, are scarcely 
tion of an additive rel 


arned emotional 
convincing. Wolpe's assump- 
ationship between conditional and unconditional 
fear is somewhat implausible, especially in relation to breakdown in the 
difficult differentiation experiment. Equally implausible is Broadhurst's 
hypothesis that increased drive level energizes initial fear reactions to 
produce the "struggling, howling and agitated behavior" 
breakdown occurs. In the first place, it would appear from the descriptions 
available that animals in whom breakdown occurred did not necessarily 
show initial fear or resistance to the rest lining harness. Broadhurst's 
hypothesis that increasing drive results from continued failure, and in- 
creases the probability of further failure, is, in al] probability, correct, 
but it is unlikely that the increased drive is dependent on prior learning. 
It is far more likely that the increase is an unconditional effect of failure 
of stimulus input to match the “neuronal model” or “expected” input 
(Sokolov, 1960). Broadhurst's increasing-drive hypothesis thus gives a 
plausible account of why discrimination fails, but not of the resultant 
breakdown of behavior. One aspect of behavior disturbance in discrimina- 


observed when 
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tion experiments which creates difficulties for both theories is the relative 
suddenness with which the behavior disturbance typically occurs. The 
term “breakdown” appears to give an apt description of what happens. 
The theories of both Wolpe and Broadhurst, however, would lead one to 
expect a gradual development of behavioral disturbance. As it is also 
difficult to see how either theory could account for the many phenomena 
of disturbance of “phasic activity" which Pavlov (1927) describes, there 
would seem to be reason to question whether the phenomena of experi- 
mental neuroses in animals can adequately be described purely in terms 
of maladaptive responses. This question has important implications for 
the therapy of human neuroses, which will be taken up briefly in section 


IV, B. 


B. THERAPY or BEHAVIOR DISTURBANCES BY DIRECT CONDITIONING 
METHODS 

During the past decade there has been an increasing interest in the 
possibility of treating behavior disorders by direct conditioning methods. 
The procedures used are direct in the sense that they are aimed at the 
elimination of undesirable behavior patterns, or the development of 
desirable patterns, by the presentation of specific and precise stimulus 
the conventional therapeutic methods based 
e termed indirect be- 
“complex” which is 


relationships. By contrast, 
on Freudian theory, or one of its derivatives, may b 
cause attention is directed primarily toward the 
assumed to give rise to the symptoms. 

The method of direct conditioning in the treatment of behavior dis- 
orders has been termed “behavior therapy” by Eysenck (1960), but not 
all psychologists who use direct conditioning techniques find the implica- 
tions of this term acceptable. Eysenck (1960) has brought together vir- 
tually all of the papers dealing with behavior therapy which were pub- 
lished prior to 1960. The disorders which have been treated by this 
method with apparent success include phobias, obsessions, stuttering, 
fetishes, writer's cramp, alcoholism, hysterical paralyses, frequency of mic- 


turition, and enuresis. 
yf direct conditioning methods of treatment have centered 


Criticisms © у 
als only with symptoms, 


mainly around the charge that the treatment de 
and since the underlying conflict is left unresolved, symptom substitution 
anxiety is to be expected. No clear case of symptom sub- 
ave been recorded, however, and the general per- 
ing the relief of specific symptoms appear to 
(Eysenck, 1960; Lovibond, 


or exacerbated 
stitution appears to h 
sonality changes accompany 
be beneficial in the great majority of cases 
19641). 


Studies of direct conditioning treatment have usually dealt with 
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single cases, or at best very small numbers of 5з, and hence are open to 
the criticism that, as in the case of verbal psychotherapy (Eysenck, 1960), 
improvement over spontaneous recovery rates has not been demonstrated. 
In a few cases, predictions of the outcome of changes in therapeutic 
technique (e.g., Yates, 1958) have provided strong evidence of internal 
consistency, but only controlled group studies can establish the efficacy 
of direct conditioning treatment beyond any doubt. Eysenck (1960) has 
discussed the problems involved in providing unequivocal evidence of 
the effectiveness of any form of treatment of neurotic disorders. In the 
case of enuresis, the problems are far less formidable, and there is now 
sufficient evidence to warrant making direct conditioning therapy the 
treatment of choice for this disorder (Jones, 1960; Lovibond, 1964b). 
Initial arrest of wetting is achieved in 90% of cases, but as in direct 
conditioning treatment generally, 


the problem of preventing relapse or 
extinction of conditioning is 


à pressing one. Recent experimental work 
has been directed to the application of laboratory techniques, including 
partial reinforcement. schedules, to this problem (Lovibond, 1963c). By 
and large the demonstrational phase of direct conditioning treatment has 
passed, and it can be expected that the future will see more research into 
the precise mechanisms of particular behavioral disturbances, and a con- 
sequent increase in the precision of application of conditioning principles. 

The limits of applicability of direct conditioning methods is a moot 
question. We have seen that theorists such as Wolpe (1952, 1958) regard 
all neurotic disorders as learned malad 
hence susceptible to direct conditioning or behavior therapy. According 
to this approach, which has been discussed in detail by Eysenck (1960) 
and Yates (1958), a neurosis is symptoms and nothing else. It follows that 
treatment can only be symptomatic, i.e., aimed directly at the maladap- 
tive behavior patterns. This conception of neurotic disorders is usually 
contrasted with the Freudian or “dynamic” view that the neurosis is an 
unconscious conflict of which the symptoms are merely surface indicators. 

There is, however, a third view which has been associated in par- 
ticular with the work of Pavlov (1927) and Hebb (1947). According to 
this view, it is necessary to make a sharp distinction between behavior 
arising from a central state or event and that central state itself, in the 
present case between neurotic behavior and an hypothesized disturbance 
of nervous activity which constitutes the neurosis. This, of course, is a 
distinction which those who take the “maladaptive habit” view of neurosis 
do not recognize. 

If the distinction is accepted, a satisfactory explanation of neuroses 
must be sought in the neurophysiological mechanisms. of behavior rather 
than the behavior itself. The implication of this point of view is that 


aptive behavior patterns, and 
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neurosis may be manifest primarily in a set of maladaptive behavior 
patterns, or it may be manifest primarily in an inability to cope with pre- 
viously simple tasks and problems, and difficulty in acquiring new, adap- 
tive learning. To the extent that the latter features predominate, there 
will be an emphasis in therapy on procedures designed to affect the central 
state as such, including rest and sleep therapy, and the use of drugs, but 
conditioning procedures may be used in the development of new adaptive 
behavior patterns. To the extent that maladaptive behavior patterns pre- 
dominate, the therapeutic emphasis may well be placed on such methods 
аз reciprocal inhibition, desensitization, and aversive therapy. Those mal- 
adaptive behavior patterns which have been acquired under conditions 
of trauma or extremes of excitation are likely to have the character of 
S-R linkages, which, it is reasonable to hypothesize, will be outside the 
range of possible central integration and modification. It follows that the 
behavior patterns in question will be beyond the scope of verbal psycho- 
therapy, and direct conditioning procedures will be the only methods 
with a reasonable chance of success. It will be recognized, however, that 
neurosis will often involve disturbances of central processes as manifest 
in attitudes and beliefs, for the modification of which verbal psychothera- 
peutic procedures will be appropriate. The form of such psychotherapy 
will not necessarily bear a very close resemblance to conventional 


Freudian procedures. 
References 


Al-Issa, I. The effect of attitudinal factors on conditioning. In H. J. Eysenck (Ed.), 
Experiments in motivation. New York: Pergamon Press, 1963. 

Anand, B. K., and Brobeck, J. К. Hypothalamic control of food intake in rats and cats. 
Yale J. Biol. & Med., 1951, 24, 123-140. 

Anokhin, P. K. Electroencephalographic analysis of corticosubcortical relations. in 
positive and negative conditioned reactions. N.Y. Acad. Sci., 1961, 92, Art. 3, 


899-938. 
Barendregt, hE Research in psychodiagnostics. The Hague, Netherlands: Mouton, 


1961. 

Baron, M. R., and Connor, J. P. Eyelid conditioned responses with various levels of 
anxiety. J. exp. Psychol, 1960, 60, 310-313. 

Beam, J. C. Serial learning and conditioning under real-life stress. J. abnorm. soc. 
Psychol., 1955, 51, 543-551. 

Becker, W. C. Cortical inhibition and extraversion-introversion. J. abnorm. soc. 
Psychol., 1960, 61, 52-66. 

Becker, W. C, and Matteson, H. H. GSR conditioning, anxiety, and е 
J. abnorm. soc. Psychol., 1961, 62, 427-430. 

Bender, L., and Schilder, P. Unconditioned and conditioned reactions to pain in schizo- 
phrenia. Amer. J. Psychiat., 1930, 10, 365-384. 

Bindra, D., Paterson, A. L., and Strzelecki, J. On the relation between anxiety and 
conditioning. Canad. J. Psychol., 1955, 9, 1-9. 


aversion. 


164 S. Н. LOVIBOND 


Bitterman, M. E., and Holtzman, W. H. Conditioning and extinction of the Бау 
skin response as а function of anxiety. J. abnorm. soc. Psychol., 1952, 47, 615-623. 

Broadhurst, P. Abnormal animal behavior. In H. J. Eysenck (Ed), Handbook of 
abnormal psychology. London: Pitman, 1960, pp. 726-763. 

Cook, S. W., and Harris, R. E. The verbal conditior 
J. exp. Psychol., 1937, 21, 202-210. 

Das, J. P. An experimental study of the relation 
reactive inhibition. Thesis submitted for Ph.D 

Davis, H. The electroencephalogram. 

Davis, R. C, Buchwald, А. M. 
responses, and their rel 
69 (20). 

Diamond, S$., Balvin, R. S., and Diamond, F. 
behavioral approach to 

Durup, G., and Fessard 
1935, 36, 1-32. 

Eysenck, H. J. The dynamics o; 
Paul, 1957. 

Eysenck, H. J. Handbook of abnormal psychology. 1 

Eysenck, H. J- Conditioning and personality. 

Eysenck, H. J» and Claridge, 
dimensional framework of 
64, 46-55. 


Eysenck, S. B. С. An experimental stud 


ning of the Galvanic Skin Reflex. 


between hypnosis, conditioning and 
- Univer. of London, 1957. 

Tabulae Biologicae, 1938, 16, 116-131. К 
; and Frankmann, R. W. Ашопотіс and muscular 
ation to simple stimuli, Psychol. Monogr., 1955, No. 405, 


R. Inhibition and choice: A nuero- 
problems of plasticity in behavior. New York: Harper, une 
b А. L'éctrencéphelogramme de l'homme. L'annee Psychol, 


f anxiety and hysteria. London: Routledge & Kegan 


„ondon: Pitman, 1960. 

Brit. J. Psychol., 1962, 53, 299-305. 

G. The position of hysterics and dysthymics in а two 
personality description. J. abnorm. soc. Psychol., 1962, 


y of psychogalvanic reflex responses of normal 

neurotic, and psychotic subjects. J. Psychosom. Res., 1956, 1 -272. 

Field, I. G., and Brengelmann, J. C. Eyelid conditioning and three personality parame- 
ters. J. abnor. soc. Psychol., 1961, 63, 517-523. 

Finesinger, J. E., and Finesinger, G. L. A modification of the Krasnogorski method 


for stimulating and measuring the secretions from the parotid glands in human 
beings. J. lab. clin. Med. 1937, 23, 267-273. 


Finesinger, J. E., and Suther! nd, G. F. The salivary conditional reflex in man. Trans- 
am. neurol. assn., 1939, р. 50. 

Finesinger, J. E., Sutherland, G. E, and McGuire, F, F, The positive conditional 
salivary reflex in psychoneurotic patients. Amer. J. Psychiat., 1942 99, 61-74. 

Fitzwater, M. E., and Thrush, 


R. S. Acquisition of a conditioned response as а 
function of forward temporal contiguity. J. exp. Psychol., 1956, 51, 59-61. 


Franks, С. М. Conditioning and personality: A Study of normal and neurotic subjects. 
J. abnorm. soc. Psychol., 1956, 52, 143-150. 

Franks, C. M. P. 
48, 119-126. 

Franks, C. M. Conditioning and abnormal behavior. In H, J. Eysenck (Ed.), Handbook 
of abnormal psychology. London: Pitman, 1960. 

Freund, K. Personal communication, 1969. 

Freund, K. A laboratory method for diagnosing predominance 
sexual interest in the male. Behav. Res. Ther., 1963, 1, 85-93 

Gantt, W. H. The origin and developmeitt. of nervous disturbances experimentally 
produced. Amer. J. Psychiat., 1942, 98, 475-481. 

Gantt, W. H. Experimental basis for neurotic behavior: Origin and development of 


artificially produced disturbances of behavior in dogs. New York: Hoeber-Harper, 
1944. 


» 25 


ersonality factors and the rate of conditioning. Brit, J. Psychol., 


1957, 


of homo- or hetero- 


PERSONALITY AND CONDITIONING 165 


Gilberstadt, H., and Davenport, G. Some relationships between GSR conditioning and 
judgments of anxiety. J. abnorm. soc. Psychol., 1960, 60, 441-443. 

Grings, W. W. Preparatory set variables related to classical conditioning of autonomic 
responses. Psychol. Rev., 1960, 67, 243-252. 

Guk, E. D. The conditioned reflex activity in schizophrenics. Sovetsk. Nevropatol., 1934, 
рр. 76-84. 

Hebb, D. О. Spontaneous neurosis in chimpanzees. Theoretical relations with clinical 
and experimental phenomena. Psychosom. Med., 1947, 9, 3-16. 

Henry, W. E, and Knott, J. R. A note on the relationship between personality and 
the alpha rhythm of the electroencephalogram. J. exp. Psychol., 1941, 28, 362-366. 

Hilgard, E. R., and Humphreys, L. G. The effect of supporting and antagonistic 
voluntary instructions on conditioned discrimination. J. exp. Psychol. 1938, 22, 
291-304. 

Howe, E. S. GSR conditioning in anxiety states, normals, and chronic functional 
schizophrenic subjects. J. abnorm. soc. Psychol., 1958, 56, 183-189. 

Hull, C. L. Principles of behavior. New York: Appleton-Century-Crofts, 1943. 

Hull, C. L. A behavior system. New Haven, Connecticut: Yale Univ. Press, 1952. 

Ivanov-Smolens A. G. Uber die bedingten Reflexe in der depressiven Phase das 
manisch-depressiven Irreseins. Mschr. Psychiat. Neurol., 1925, 58, 376-388. 

Jasper, H., and Shagass, C. Conditioning the occipital alpha rhythm in man. J. exp. 
Psychol., 1941, 28, 373-388. 

Jenkins, J. J. and Lykken, D. Individual differences. Ann. Rev. Psychol, 1957, 8, 
79-112. 

Jensen, A. Personality. Ann. Rev. Psychol., 1958, 9, 295-322. 

John, I. D. Personal communication. Univer. of Melbourne, Australia, 1962. 

Jones, Н. G. The behavioral treatment of enuresis nocturna. In H. J. Eysenck (Ed.), 
Behavior therapy and the neuroses. New York: Pergamon Press, 1960. 

Kaada, В. R., and Johannsen, N. B. Generalized electrocortical activation by cortical 
stimulation in the cat. EEG clin. Neurophysiol., 1960, 12, 567-573. 

Kimble, G. A. Conditioning as a function of the time between conditioned and 
unconditioned stimuli. J. exp. Psychol., 1947, 37, 1-15. 

Kimble, G. A. Hilgard and Marquis’ Conditioning and learning. (2nd ed.) New York: 


Appleton-Century-Crofts, 1961. 
Kimble, G. A., Mann, L., and Dufort, R. H. Classical and instrumental eyelid condi- 


tioning. J. exp. Psychol., 1955, 49, 407-417. 

Kimmel, H. D., and Pennypacker, H. S. Differential GSR conditioning as a function 
of the CS-UCS interval. J. exp. Psychol., 1963, 65, 559-563. 

Knott, J. R., and Henry, W. E. The conditioning of the blocking of the alpha rhythm 
of the human electroencephalogram. J. exp. Psychol., 1941, 28, 134-144. 

Krasnogorski, N. I. Bedingte und unbedingte Reflexe im Kindersalter und ihre 
Bedeutung für die Klinik. Ergebn. d. inn. Med. u. Kinderh., 1931, 36, 613. 

Lacey, J. 1, Smith, R. L., and Green, A. Use of conditioned autonomic responses in 
the study of anxicty. Psychosom. Med., 1955, 17, 208-217. 

Landkof, B. L. Unconditioned and conditioned vascular reflexes in schizophrenics. 
Trud. Tsentral. Psikhonevrol. Inst., 1938, 10, 37-63. 

Lentz, A. K. Les reflexes conditionnels salivaires chez l'homme sain ct aliéné et leur 
rapprochement avec les donnes de la conscience. L'encephale, 1935, 11, 394. 

Liddell, H. S. The experimental neurosis. 4nn. Rev. Physiol. 1947, 9, 569-580. 

Liddell, H. S. Emotional hazards in animals and man. Springfield, Illinois: Charles C 


Thomas, 1956. 


166 S. Н. LOVIBOND 


Loomis, A. L., Harvey, E. N., and Hobart, С. A. Brain potentials during hypnosis. 
Science, 1936, 83, 239-241. 


Lovibond, S. H. The Object Sorting Test and conceptual thinking in schizophrenia. 
Austral. J. Psychol., 1954, 6, 52-70. 


Lovibond, S. H. Conceptual thinking, personality, and conditioning. Brit. J. soc. clin. 
Psychol., 1963a, 2, 100-111. 


Lovibond, S. H. Positive and negative conditioning of the GSR. Аса Psychologica, 
1963b, 21, 100-107. 


Lovibond, S. H. Intermittent reinforcement in behavior therapy. Behav. Res. Ther., 
1963c, 1, 127-132. 

Lovibond, S. H. Manual for the Object Sorting Test. U 
of Adelaide, Australia, 1964. (a) 

Lovibond, S. Н. Conditioning and enuresis. 


npublished manuscript, Univer. 


New York: Pergamon Press, 1964. (b) 
McAllister, W. R. Eyelid conditioning as a function of the CS-US interval, J. exp. 
Psychol., 1953, 45, 417-422. 


Malmo, R. B. Measurement of drive: an unsolved 


problem in psychology. In М. К. 
Jones (Ed.), Nebraska Symposium on Motivation. Lincoln: Univer, of Nebraska 
Press, 1958. 


Martin, I. GSR conditioning and pseudoconditioning. Brit. J. Psychol., 1962, 58, 365-371. 

Martin, 1, Delayed GSR conditioning and the effect of electrode placement on measures 
of skin resistance. J. Psychosom. Res., 1963, 7, 15-22, 

Masserman, J. H. Behavior and neurosis: an experimental 
psychobiologic principles. Chicago, Illinois: Univer. 

Mays, L. L. Studies of catatonia. V. Persever: 


psychoanalytic approach to 
of Chicago Press, 1943. 


ational tendencies in catatonic patients. 
Psychiat. Quart., 1934, 8, 728-735. 
Meehl, P. E. Schizotaxia, schizotypy, and schizophrenia. Amer, Psychologist, 1962, 17, 
827-838. i 
1 f * CS-UCS interval in. è куле 
шш The CS-UCS interval in GSR conditioning. J. exp. Psychol, 1954, 48, 


Moruzzi, G., and Magoun, H. W. Brainstem 
EEG. EEG clin. Neurophysiol. 1949, 1, 4: 73. 

Nebylitsyn, V. D. Die Anwendung der Faktorenanalyse bei der Erforschung der 
Struktur der höheren Nerventiitigkeit. Probleme und Жене, di ES М s 
1961, 3-4, 119-126. = А er Psychologie, 

Nebylitsyn, V. D. Personal communication, 1969. 

Nebylitsyn, V. D. Certain electroencephalographic indi 
processes. Soviet Psychol. & Psychiat., 1963, 1, 2 

Nebylitsyn, V. D., Rozhdestvenskaya, V. L, and Teplov, B, М. Concerni he inter- 
relation between absolute sensitivity and Strength of the nervou psi: the i 1 
exp. Psychol. 1960, 12, 17-25. s System. Quart. J- 

O'Connor, N. 1959. Ref. in Willet, R. A. Measures of | 
H. J. Eysenck (Ed.), Experiments in personality, 1 


ticular formation and activation of the 


cators of equilibrium in nervous 
— 
--. 


caring and conditioning. In 


-ondon: " x a 
1960, Vol. 11. adon Routledge & Kegan Paul, 
Olds, J. A preliminary mapping of electrical reinforcing emt... ; | 
J. comp. physiol. Psychol., 1956, 49, 281-285. В effects in the rat brain. 
Olds, J. Effects of hunger and male sex hormone On self-stj 
-st ati - 
J. comp. physiol. Psychol, 1958, 51, 320-394. ‘mulation of the brain. 
Olson, W. С. Problem tendencies in children. 


Minneapolis: U 


1930. niver. of Minnesota Press. 


PERSONALITY AND CONDITIONING 167 


Pavlov, I. P. Conditioned reflexes. (Transl. by C. V. Anrep.) London and New York: 
Oxford Univer. Press, 1927. 

Peters, Н. N., & Murphree, О. D. The conditioned reflex activity in the chronic 
schizophrenic, J. clin. Psychol., 1954, 10, 126-130. 

Pfaffman, C., and Schlosberg, Н. The conditioned knee-jerk in psychotic and normal 
individuals. J. Psychol., 1936, 1, 201-205. 

Prokasy, W. F., Hall, J. F., and Fawcett, J. T. Adaptation, sensitization, forward and 
backward conditioning, and pseudo-conditioning of the GSR. Psychol. Rep., 1962, 
10, 103-106. 

Razran, С. Н. $. Conditioned responses: an experimental study and a theoretical 
analysis. Arch. Psychol., 1935, No. 191. 

Razran, G. H. S. Attitudinal control of human conditioning. J. Psychol., 1936, 2, 
327-397. 

Richter, C. P., and Wada, T. Method of measuring salivary secretion in human beings. 
J. lab. & clin. Med., 1924, 9, 271. 

Rodnick, E. H. Characteristics of delayed and trace conditioned responses. J. exp. 
Psychol., 1937, 20, 409-425. 

Rozhdestvenskaia, V. L, Nebylitsyn, V. D., Borisova, M. N., and Ermolaeva-Tomina, 
L. B. A comparative study of various indicators of the strength of the human 
nervous system. Soviet Psychol. & Psychiat., 1963, 1, 10-22. 

Schiff, E., Dougan, C., and Welch, L. The conditioned PGR and the EEG as indicators 
of anxiety. J. abnorm. soc. Psychol., 1949, 44, 549-552. 

Shipley, W. C. Studies of catatonia: VI. Further investigation of the perseverational 
tendency. Psychiat. Quart. 1934, 8, 736-744. 

Silverman, R. E. Eliminating a conditioned GSR by reduction of experimental anxiety. 
J. exp. Psychol, 1960, 59, 122-125. 

Sokolov, E. N. Neuronal models and the orienting reflex. In Mary A. B. Brazier (Ed.), 
The central nervous system and behavior. Madison, New Jersey: Madison Printing 
Co., 1960. 

Solyom, L., and Beach, L. Further studies upon the effects of the administration of 
ribonucleic acid in aged patients suffering from memory (retention) failure. 
Neuropsychopharmacol., 1961, 2, 351-355. 

Spence, K. W. The relation of anxiety to differential eyelid conditioning. J. exp. 
Psychol., 1954, 47, 127-134. 

Spence, К. W., and Farber, I. E. Conditioning and extinction as a function of anxiety. 
J. exp. Psychol., 1953, 45, 116-119. 

Spence, К. W., and Коз, І. E. А methodological study of the form and latency of 
eyelid responses in conditioning. J. exp. Psychol., 1959, 58, 376-385. 

Spence, К. W., and Taylor, J. A Anxiety and strength of the UCS as determiners of 
the amount of eyelid conditioning. J. exp. Psychol., 1951, 42, 183-188. 

Spence, K. W., and Taylor, J. A. The relation of conditioned response strength to 
anxiety in normal, neurotic, and psychotic subjects. J. exp. Psychol., 1953, 45, 
265-272. 

Spence, К. W., Farber, I. E. and Taylor, E. The relation of electric shock and anxiety 
to level of performance in eyclid conditioning. J. exp. Psychol., 1954, 48, 404-408. 

Spooner, A., and Kellogg, W. N. The backward-conditioning curve. Amer, J. Psychol, 
1947, 60, 321-334. 

Stern, J. А., Das, К. C. Anderson, J. M., Biddy, К. L., and Surphlis, W. Conditioned 
alpha desynchronization. Science, 1961, 134, 388-389. 


168 5- Н. LOVIBOND 


Stewart, M. A. Winokur, G., Stern, J. A., Guze, S. B., Pfeiffer, E., and Hornung; F. 
Adaptation and conditioning of the galvanic skin response in psychiatric patients. 
J. ment. Sci., 1959, 105, 1102-1111. Р 

Stewart, M. А., Stern, J. A., Winokur, G., and Fredman, S. An analysis of GSR condi- 
tioning. Psychol. Rev., 1961, 68, 60-67. А 

Sweetbaum, Н. А. Comparison of the effects of introversion-extraversion and anxiety 
on conditioning. J. abnorm. soc. Psychol., 1963, 66, 249-954. 

Switzer, S. A. Anticipatory and inhibitory characteristics of delayed conditioned 
reactions. J. exp. Psychol., 1934, 17, 603-620. 

Taylor, J. A. The relationship of anxiety to the conditioned eyelid response. J. exp. 

Psychol., 1951, 41, 81-92, 

Taylor, J. A. Drive theory and manifest anxiety. Psychol, Bull., 1956, 53, 303-320. 

Taylor, J. A., and Spence, К. W. Conditioning level in the behavior disorders. J. ab- 
norm. soc. Psychol., 1954, 49, 497-502. 

Teplov, В. М. Typological Properties of the n 
manifestations. In N, O'Connor (Ed.), 
Bamon Press, 1961, y 

Vogel, M. D. The relation of personality factors to GSR conditioning of alcoholics. 
An exploratory study. Canad. J. Psychol., 1960, 14, 275-280. 

Vogel, M. D. GSR conditioning and personality factors in alcoholics and normals. 
J. abnorm. soc. Psychol., 1962, 63, 417-421. 

Voronin, L. G., and Sokolov, 


crvous system and their psychological 
5 ў /ork: Per- 
Recent Soviet psychology, New York: Рет 


E. N. Cortical mechanisms of the orienting reflex and 

its relation to the conditioned reflex. Та Н. H. Jasper & G. D. Smirnov (Eds) 
Moscow Colloquium on Electroencephalography of Higher Nervous Activity. 
EEG clin. Neurophysiol., 1960, Suppl. No. 13. 

Welch, L., and Kubis, J. Conditioned PGR (psychogalvanic response) in states of 
pathological anxiety. J. nerv. ment, Dis., 1947, 105, 372.381. 

Wells, C. E., and Wolff, H. G. Formation of temporary cerebral connections in normal 
and brain-damaged subjects. Neurology, 19602, 10, 335-340. 

Wells, C. E, and Wolff, Н. С. Electrographic evidence of impaired brain-function in 
chronically anxious patients. Science, 1960b, 131, 1671-1679. 

White, C. T., and Schlosberg, H. Degree of conditioning of the GSR as a function of 
the period of delay. J. exp. Psychol., 1952, 43, 357-362. 

Wickens, D. D., and Wickens, C. D. Some 
Psychol., 1942, 31, 518-526. 

Wickens, D. D., Allen, С. K., and Hill, F. A. Eft 


*ct of instructions and UCS strength 
on extinction of the conditioned GSR, J. exp. Psychol., 1963, 66, 235-240. 


Willett, R. А, Measures of learning and conditioning. In Н. J. Eysenck (Ed.), Experi- 
ments in personality. London: Routledge & Kegan Paul, 1960, Vol. II. 

Willet, R. A. Eyeblink conditioning and situation induced anxiety drive, In H. J. 
Eysenck (Ed), Experiments in motivation, New York: Pergamon Press, 1963. 


Wolpe, J. Experimental neuroses as learned behaviour, Brit. J. Psychol., 1952, 43, 243- 
268. 


factors related to pscudoconditioning. J. exp. 


Wolpe, J. Psychotherapy by reciprocal inhibition, Stanford, 
ver. Press, 1958. 

Wyrwicka, W., and Dobrzecka, C. Relationship between 
of the hypothalamus. Science, 1960, 132, 805-806. 
Yates, A. J. The application of learning theory to the tre: 

Psychol., 1958, 56, 175-182. 


California: Stanford Uni- 
feeding and satiation centers 


atment of tics, J. abnorm. soc. 


REPRESSION-SENSITIZATION AS A 
DIMENSION OF PERSONALITY 


Donn Byrne 


THE UNIVERSITY OF TEXAS, AUSTIN, TEXAS 


I. Approach versus Avoidance Responses to Threatening 


Stimuli 170 
A. Perceptual Defense g ak 170 
B. Measurement of Repression-Sensitization 173 
IL A Review of Research on the R-S Scale . 177 
A. Repression 177 
B. Measures of Self-Description 186 
C. Maladjustment ++ жя 190 
D Iditional Antecedents, Correlates, and Consequents .. 197 


HI. Repression-Sensitization: What Are the Next Research 


Steps? ...- 208 
A. What Is Being Measured by the R-S Scale? 208 
B. Considerations in Building a Theory of Repression 
Sensitization 211 
References... 215 


Psychologists in the field of personality have evidenced a consistent 
and long-continued interest in unconsciously motivated behavior. It may 
be observed that some individuals fail to verbalize feelings of anxiety 
when confronted by stimuli judged by outside observers to be anxiety 
evoking. Further, these same individuals are seen subsequently to engage 
in activities which may be hypothesized to control or reduce anxiety. 
Observations and speculations of this variety have stimulated a consider- 
able amount of research, especially during the past two decades. 

Anxiety-reducing activities which are unconsciously motivated are 
given the label of defense mechanisms. Various descriptions of these 
mechanisms, originating almost exclusively in psychoanalytic theory, 
have been proposed over the years. A unidimensional categorization 
which encompasses many diverse mechanisms grew out of the research of 
the new look in perception in the 1940's: repression-sensitization. At one 
end of this continuum of defensive behaviors are those responses which 
involve avoidance of the anxiety-arousing stimulus and its consequents. 
Included here are repression, denial, and many types of rationalization. 
At the sensitizing extreme of the continuum are behaviors which involve 
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an attempt to reduce anxiety by approaching or controlling the stimulus 
and its consequents. The latter mechanisms include intellectualization, 
obsessive behaviors, and ruminative worrying. 

Beginning with his 1950 doctoral dissertation at Stanford, Charles 
Eriksen was one of the first investigators to make and to pursue this dis- 
tinction of types of defense mechanisms while Jesse Gordon (1957) intro- 
duced the terms “repressor” and “sensitizer.” The specific research which 
formed the background for much of the current work on the repression- 
sensitization dimension will be reviewed briefly. 


1. Approach versus Avoidance Responses to 
Threatening Stimuli 


A. PERCEPTUAL DEF 


SE 
1. Initial Investigations 


An early and extremely influential article in the postwar perceptual 
studies was co-authored by Jerome Bruner and Leo Postman (19472). The 
basic assumption underlying most of the 
at the outset of that article (p. 300): 

“Perception is a form of adaptive behavior. 
not only the characteristics of 


new-look approach was stated 


Its operation reflects 
sensorineural processes, but also the 
dominant needs, attitudes, and values of the organism. For perception 
involves a selection by the organism of a relatively small fraction of the 
multiplicity of potential stimuli to which it is exposed at any moment 
in time. In perception, moreover, certain stimuli are accentuated and 
vivified at the expense of others. Finally, what is “habitually seen” in any 
given perceptual situation is a function of the fixation of past perceptual 
responses in similar situations. Through these three processes—selection, 


accentuation, and fixation—the adaptive needs of the organism find ex- 
pression in perception.” 


Individual differences in perceptual 
came evident in investigations emerging 
Relations at Harvard. With 19 undergraduates as subjects, Bruner and 
Postman (1947b) obtained associative reaction times for each of 99 words 
including a large proportion of potentially threatening ones such as 
“raped,” “death,” “penis,” “agony,” and “blush.” Two weeks later, each 
subject was presented with 18 stimulus words (those yielding the indi- 
vidual’s six fastest, midmost, and slowest reaction times) on a tachisto- 
scope. Each word was presented at increasingly slower exposure speeds 
until correct recognition was obtained. A significant relationship between 
reaction time and recognition exposure speeds was found. Of primary 


adaptation to threat soon be- 
from the Laboratory of Social 
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interest here was the finding of two different patterns of response among 
the subjects. In attempting to achieve an explanation of the two patterns, 
the authors suggested that some individuals revealed a defense process 
in which recognition threshold was a monotonic increasing function of 
associative reaction time. The greater the anxiety, the greater the “регсер- 
tual defense." The responses of other subjects suggested a sensitizing 
process in which recognition time was actually faster for the most anxiety- 
provoking words than for the six midmost words. Rather than avoiding 
the perception of threat, the latter individuals were characterized as 
vigilant in perceiving it. Perceptual vigilance had been found previously 
in response to goalrelevant stimuli congruent with need reduction 
(Bruner and Goodman, 1947) and value systems (Postman, Bruner, and 
McGinnies, 1948), but the notion of sensitization to threat created a new 
wave of research interest. 


2. Individual Differences in Perceptual Defense 


The bulk of the perceptual defense literature which followed the 
Harvard studies dealt with demonstrations, explanations, and criticisms 
of findings reporting threshold differences for threat versus nonthreat 
stimuli. In addition, however, individual differences in characteristic 
defensive reactions along a repressing-sensitizing dimension were also 
under investigation. In general, research was conducted in which subjects 
were categorized as to defensive mode by one means or another and then 
tested for perceptual threshold differences for threat versus nonthreat 
stimuli (Carpenter, Wiener, and Carpenter, 19 6; Eriksen, 1951, 1952a; 
Kissin, Gottesfeld, and Dickes, 1957; Kogan, 1956; Kurland, 1954; Lazarus, 
Eriksen, and Fonda, 1951; Nelson, 1955; Perloe, 1960; Shannon, 1962). 

The subjects in these studies consisted of the usual groups of college 
hospitalized neuropsychiatric patients, and ambulatory 
clinic patients. Differential recognition thresholds or differential accuracy 
scores were obtained for a variety of neutral and conflictful (primarily 
sex апа hostility) stimuli which included words, drawings, Blacky pic- 
tures, and sentences recorded on tape. Recognition was made difficult by 
variations in speed of exposure, intensity of illumination, legibility of 
successive carbon copies, loudness of recordings, and intelligibility of 
auditory presentations partially masked by white noise. Independent in- 
dicators of defensive mode included readiness to verbalize conflictful 
material on sentence completion tests and in TAT stories, ability to re- 
call failure versus success words in an experiment using scrambled sen- 
tences, ability to recall Blacky pictures, scores on the California F Scale, 
scores on the Defense Preference Inquiry for the Blacky pictures, clinical 


undergraduates, 
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ratings of Rorschach and Machover Figure-Drawing protocols, and classi- 
fication as to characteristic defensive mode on the basis of interviews and 
case history material. 


Although these studies utilized a variety of subject populations, 
perceptual tasks, and measures of defenses, significant relationships were 
consistently reported between perceptual behavior and defenses with few 
exceptions (e.g., Kurland, 1954). Those individuals who have difficulty 
in perceiving threatening material accurately also give evidence of block- 
ing, repression, and avoiding when responding to conflictful stimuli in 
other contexts, Conversely, those who perceive threatening stimuli as 
accurately as or more accurately Шап neutral stimuli respond in other 
Situations with intellectualization, sensitization, and general approach 
behavior. 

It should be noted that the explanation for these findings is far from 
Clear. As with other perceptual defense data, some investigators prefer to 
explain the findings in terms of variables other than defense mechanisms 
and unconscious motives, For example, after reviewing the literature on 
individual differences, Brown (1961, p. 323) suggests: 

“The individual-difference method does not, however, preclude the 
possibility that the reactions to all tests are due to differential experiences 
with the materials of the tests rather than to percepual defense. When 
individuals who are “expressive” with respect to sexual matters are selected 
out of a group, we may simply be selecting those who, in addition to 
being expressive, or perhaps because of their expressiveness, have had 
more extensive experiences with sexual words, symbols, and ideas. On 
the other hand, the test-defined ‘sexual inhibitor’ might well have had 
fewer exposures to, and experiences with materials of a sexual nature. 
If such groups differ in their ability to identify sexual words, it might 
be more sensible to explain these results by referring to individual dif- 
ferences in associative strength rather than to traits of defensiveness or to 
value systems with vague motivational overtones.” 

Given such an explanation, the mysterious qu 
defense vanishes in the clear air of the known relationship between word 
familiarity and perceptual accuracy (e.g, Solomon and Postman, 1952). 
"Thus, theoretical speculations about the perceptual data have tended to 
move away from some of the more colorful earlier notions such as un- 
conscious perception. We are nevertheless, as Lazarus et al, (1951) point 
out, left with individual differences in expressiveness, experiences with 
conflictful material, defenses, or whatever label one chooses to apply. For 
personality psychologists, the questions still remain as to th 
correlates, and consequents of these individual differences, 


ality of perceptual 


€ antecedents, 
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3. Repression-Sensitization in Other Response Measures 


At approximately the same period in time as the differential thresh- 
old studies, many other investigations utilizing repressing-sensitizing vari- 
ables were conducted. It was found, for example, that subjects who tend 
to recall their failures in an experimental task tend to recall material 
associated with a painful shock, while those who forget one forget the 
other (Lazarus and Longo, 1953). Individuals who recall incompleted 
tasks in a threatening situation learn affective words as easily as neutral 
ones, while those who forget the incompleted tasks experience relative 
difficulty in learning affective words (Eriksen, 1952b). Those subjects 
who are able to verbalize a pattern of electric shock applied during a 
learning task are able deliberately to avoid giving the punished responses 
(Eriksen and Kuethe, 1956). Relatively shorter latency for aggressive and 
succorant words on a word-association test is associated with the acceptance 
of such concepts on the Rorschach, while relatively longer latency is 
associated with failure to accept the concepts on the Rorschach (Eriksen 
and Lazarus, 1952). Giving emotional words in response to appropriate 
TAT cards is positively related to the number of Rorschach responses 
given (Ullman, 1958). Patients identified as facilitators according to their 
case histories prefer sexual and aggressive humor while patients identified 
as inhibitors prefer nonsense humor (Ullman and Lim, 1962). 

An examination of the perceptual studies and the subsequent work 
suggests rather strongly the presence of an approach-avoidance sort of 
dimension with respect to response to threatening stimuli. It should also 
be noted that these behavior tendencies appear to be fairly pervasive ones 
in that they are identifiable in perceptual responses, responses given to 
projective tests, behavior in learning and memory tasks, and in symptoms 
uch relational fertility is a convincing argument for 


of maladjustment. 5 
the value of pursuing this variable in further research. 


B. MEASUREMENT OF REPRESSION-SENSITIZATION 


A familiar problem which arises, though, is that of agreeing upon a 
suitable defining operation for repression-sensitization. In the work out- 
lined above, almost every investigator utilized a different measure of the 
defense dimension. And, while significant correlations were generally 
reported between the various approach-avoidance indices, the magnitude 
of the relationships is hardly sufficient to conceptualize them as inter- 
changeable measures of a single construct. An additional difficulty with 
the measures used in the various experiments is the problem of low or 
even unknown reliability (Byrne and Holcomb, 1962). As a first step in 


establishing the presence of a relationship between variables, relatively 
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low reliability of measurement only acts to increase the probability of 
making Type II errors. It is essential, however, in order for research in 
an area to progress beyond these first crude findings that Y is related to 
X, to obtain reliable measures of the variables involved and to utilize 
operations consistently across experiments. 

If the preceding assumption is accepted, what would be the next 
logical step in measurement? One possibility would be to investigate the 
reliabilities of each reported defining operation (differential thresholds, 
memory scores, TAT responses, etc.) and select the most reliable as the 
measure of repression-sensitization in all future work. If satisfactory reli- 
abilities were not found, the best measure could be used as a starting 
point for further test construction. The decision is a somewhat arbitrary 
one, but the goal in any event would be a consistent measuring instru- 
ment which yields scores that correlate with behaviors already defined as 
repressing-sensitizing. Consistency here means that the device has homo- 
geneous response variations across items, stability of scores over time, and 
an objective scoring system on which any two independent judges can 
obtain close agreement. А measurement approach meeting those criteria 
is one based on a combination of re 
have not yet been discussed. 


pression-sensitization indices which 


1. Use of the MMPI to Measure Defensive Behavior 


While the Minnesota Multiphasic Personality Inventory (MMPI) 
(Hathaway and McKinley, 1951) was originally devised as a diagnostic 
aid, it has subsequently been used in both research and clinical work for 
a wide variety of additional Purposes. Since the individuals classified 
into the various psychiatric syndromes identified by the test tend to 
differ in characteristic defensive modes and since some of the validity 
scales have been considered as reflecting defenses, it was perhaps inevit- 
able that MMPI scales would be utilized in work on r 


epression-sensitiza- 
tion. 


A summary of the scales and combinations of Scales used by various 
investigators is shown in Table I. There was relatively good agreement 
among investigators as to the type of 


i MMPI items most indicative of 
repressing and sensitizing defenses. 


In the work cited in Table I, the MMPI measures of repression- 
sensitization were found to be related to such variables as differential 
recall of completed and incompleted tasks, frequency of dream recall, 
defense mechanisms in hospitalized neurotics, differentia] perceptual 
recognition thresholds, and interpersonal perception. With these findings 
as a promising start, two independent efforts were made at approximately 
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the same time to devise a single, comprehensive MMPI scale to measure 


the repression-sensitization dimension. 


TABLE I 
MMPI ScaLEs Usep As DEFENSE MEASURES 
Represser Sensitizer 

Scales score score Reference 

K High Low Page and Markowitz (1955) 

1. High Low Page and Markowitz (1955); 
Tart (1962) 

F minus K Low High Ullmann (1958) 

Hy High Low Eriksen (1954); Mathews and 
Wertheimer (1958) 

Hy denial High Low Carlson (1954); Gordon (1957); 
Gordon (1959) 

Hy admission Low High Sordon (1959) 

Hy minus Pt High Low Eriksen and Davids (195 
Truax (1957) 

Pt Low High Carlson (1954; Eriksen 
(1954); Eriksen and Browne 
(1956); Eriksen and Davids 
(1955); Eriksen, Keuthe 
and Sullivan (1958) 

MAS Low High Eriksen and Davids (1955); 
Gordon (1959) 

Welsh A Low High Tart (1962) 

Welsh R High Low Tart (1962) 


2. A Repression-Sensitization Scale 


At the Behavioral Research Laboratory of the Veterans Administra- 
tion Hospital at Palo Alto, a number of investigations were undertaken 
to follow up and extend Shannon's (1955, 1962) work with facilitators 
and inhibitors. Ullmann (1958) devised a scoring system for case history 
material which was used in several projects. As a primary measure, how- 
ever, the case history scoring approach was limited by problems of inter- 
judge reliability, the use of discrete categories of individuals rather than 
a continuum, and its inapplicability to nonpatient populations. To over- 
come these difficulties, Ullmann (1962) undertook the construction of an 
empirically derived MMPI scale to measure facilitation-inhibition (or 
repression-sensitization). Using an item-analysis approach, he compared 
the responses of 38 facilitators and 24 inhibitors (identified by the case 
history method) on each of the 566 MMPI items. A cross-validation sample 
consisted of 48 facilitators and 22 inhibitors. A total of 21 items met the cri- 
terion of differentiation at the .05 level in both samples. An additional 23 
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items differentiated at the -10 level and were related to the total facilitation- 
inhibition score in a third sample of 61 subjects. The final scale consisted 
of 44 items. On a new sample of subjects, 
of .96 and test-retest reliabilities of .88 (1 to 18 months), .71 (19 to 36 
months), and .54 (37 to 85 months) were obtained. Studies utilizing Ull- 
mann's scale will be reported in a following 

At about the same time that Ullmann’s work was under way, Altroc- 
chi, Parsons, and Dickoff (1960) at Duke University approached the 
measurement of repression-sensitization via the MMPI in a somewhat 
different Way. On the basis of the work of Eriksen and Gordon, three 


MMPI scales were selected as likely measures of sensitization (D, Pt, and 


` я үз { 
Welsh Anxiety) and three others as measures of repression (L, K, and Hy 


denial). A repression-sensitization index was determined by subtracting 
each subject's total Score on the latter three scales from his total score on 
the three former ones. In neither this paper nor a subsequent report 
(Altrocchi, 1961) were reliability data presented. ө 
In work at the University of Texas, the author (Byrne, 1961, p. 337) 
attempted to refine Altrocchi's scale because "Several potential measure- 
ment difficulties arise With this measure because of item overlap among 
the six MMPI scales which are combined in it. For example, item 32 
contributes to D, Pt, and Welsh Anxiety scores, thereby giving it an ar- 
bitrary weight of three. More confusing possibilities also arise. For er 
i to D and to L, K, and Hy denial; thus, it s 
€s of the index with a net weight of two for 
erential item weights could conceivably prove 
ature in this instance is clear. 


the author substituted a scoring 
System in which each item on the six scales Was scored only once and all 


inconsistently Scored items eliminated. The result was a 156-item Re- 
pression-Sensitization (R-S) Scale i 


ing responses. Both the corrected ici 
the coefficient of stability (six weeks) we 
work (Byrne, Barry, and Nelson, 1963), an 


section. 


pendent Samples of 370 students 
each. Those 197 items which yielded correlati 


ations With the total R-S score 
significant at the .001 level or better in both Samples were retained as 
scorable items in the revised R-S scale. This revised scale was found to 
have a corrected split-half reliability of .94 and a test-retest reliability 
(3 months) of .82. 


The following section surveys the published and unpublished re- 
search in which these MMPI repression-sensitization Measures have been 
used. Approximately 97% of Altrocchi’s items were retained in the 
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original and about 70% in the revised R-S scale. The Facilitation-Inhibi- 
tion Scale has been found to correlate .76 with the R-S scale in one 
student population (Byrne, 1961), .88 in another (Liberty, Lunneborg, 
and Atkinson, in press), and .94 in a neuropsychiatric patient population 
(Ullmann, 1962). Almost half of the items in the F-I scale also are con- 
tained in both the original and revised R-S scales. Therefore, it seems 
defensible to consider the instruments as interchangeable measuring 
devices. 


II. A Review of Research on the R-S Scale 


A. REPRESSION 


Basic to the construct validity of the R-S scale is evidence relating 
scores on the test to repressive versus sensitizing behavior. As with most 
variables in the field of personality, the conceptualizations regarding 
defensive modes range from nonoperational theoretical formulations 
based on clinical observations to quite specific operations devised for 
h purposes. Since no single variable of the latter variety can en- 
compass all of the surplus meaning contained in the clinical theories 
of repression, the construct validity of the R-S scale must rest on a series 
of correlational and experimental findings. Seven different types of evi- 


resea 


dence have been reported. 


1. Judgments Concerning Defense Mechanisms 

Dr. Vincent Tempone compared the R-S scale with judgments by 
clinicians of the meaning of this variable. He had nine staff members at 
the Massachusetts Mental Health Center fill out the test as they thought 
à repressing individual would. He defined agreement as concurrence by 
at least seven of the nine judges. On the original R-S scale, judges corres- 
ponded with the key on 7276 of the items, and on the revised scale there 
Was correspondence on 90%. On none of the items did the judges agree 
On a response opposite to the key (Byrne et al., 1963). 

A second investigation by Tempone (1963) utilized ratings of 18 
therapy patients by six first-year psychiatric residents. Patients’ scores 
on the R-S scale and clinical ratings of their repression-sensitization were 
found to correlate .38, which falls short of statistical significance in this 
small sample. 

Davison (1963) administered several measures of coping mechanisms, 
including the R-S scale, to 48 male students. A cluster analysis of the 32 
variables led to the identification of three clusters. The first was essentially 
defined by the R-S scale, and it included (in addition to the MMPI 
scales used in constructing the R-S scale) the F, Pd, and Ma scales from 
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the MMPI, intellectualization as judged in 


an interview situation, and 
Identification on the Heath Phrase 


Association Test. 
2. Selective Forgetting 

At the University of Arkansas, Gossett (1964) attempted to elicit 
repressive mechanisms by combining threatening conditions with - 
memory task, "The subjects consisted of 48 repressers and 48 sensitizers 
drawn from a pool of 296 introductory Psychology students. Each subject 
learned a list of 12 nonsense syllables. Their performance on the filth 
trial constituted the first recall test, Following this, they were given an 
“Intelligence and Personality Test” made up of 12 subtests, each titled 
with one of the nonsense syllables. In the experimental group, failure 
was induced via false norms, and the failures were attributed to certain 
intellectual and/or personality problems. The control group took the 
test asa “standardization sample” without false norms or induced failure. 
After a filler task, all subjects were asked to recall the 12 syllables. As a 
control for Suppression, half of the subjects were offered rewards for 
remembering correctly ($10 and the chance to avoid several hours of 
tedious tasks) while half were given standard instructions to attempt £o 
remember the syllables, After an explanation of the purpose of the exper! 
ment to the subjects, a third recall test was given. Gossett found signi- 
ficant differences between repressers and sensitizers in their recall of the 
syllables when the effects of Suppression were controlled, He interpreted 


this finding to mean that those with low Scores on the R-S scale repress 
threatening material whi vith high scores do not. 


McReynolds and Ullmann (1964) with 40 students in an advanced 
psychology class obtained recall data for 36 words Projected briefly on а 
screen immediately after the last word had been presented, Of the 30 
nonpractice words, 12 were pleasant, 12 were unpleasant, and 6 were 
neutral. The recall score consist. E ifference between the number 
of unpleasant and pleasant words recalled. With anxiety, number of 
neutral words recalled, and general tendency to use pleasant or unpleasant 


words held constant, the correlation between Scores on the F-I scale and 
the recall score was not significant. 


3. Perceptual Defense 


Considering the origin of the conceptualizations of repression- 
sensitization, it is obvious that scores on the В.5 scale would be expected 
to be related to differential perceptual recognition thresholds for threat- 
ening versus nonthreatening stimuli. Tempone (1965 Я 
tion, utilizing an induced failure manipulation as 


Й the source of threat. 
From a pool of 244 University of Texas undergr. 


aduates, he selected 40 
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repressers (below 25th percentile) and 40 sensitizers (above 75th per- 
centile) on the basis of scores on the R-S scale. On an eightitem ana- 
grams task taken with a group of stooges, half of the subjects were given 
a failure experience (two or less anagrams correct) and half a success 
experience (six or more anagrams correct). The correct solution to each 
anagram was given to the subjects after each trial. Following this phase 
of the experiment, a perceptual test was given in which 19 words were 
presented at gradually increasing exposure speeds tachistoscopically; the 
stimuli consisted of three practice words, the eight correct anagram solu- 
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Fic. 1. Perceptual recognition thresholds for critical words under success and 
failure conditions. [After Tempone (1962).] 
tions, and eight neutral words. The order of the latter 16 words was 
counterbalanced, and the critical and neutral words were matched for 
word length and frequency based on the Thorndike-Lorge word count. 
Word thresholds were measured as number of trials required for correct 
recognition. Analysis of variance indicated a significant interaction effect 
for repression-sensitization times success-failure. Repressers and sensitizers 
did not differ significantly in the success condition, but repressers ob- 
tained significantly higher thresholds for the threat words in the failure 
condition than did sensitizers. For the neutral words, thresholds were not 
related to defensive mode or experimental conditions. The relationship 
is shown graphically in Fig. 1. 

Ullmann, Weiss, and Krasner (1963) were able to change the per- 
ceptual performance of repressers by means of a process of verbal condi- 
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tioning for use of emotional words just prior to responding to a pe 
tual defense task. The task was the recognition of matched neutral anc 
threatening words on a series of successively better carbon — 
Compared to the control group, inhibitors in the experimental apu 
showed significantly less difference in their recognition thresholds 20 
threatening and neutral words; facilitators in the two groups did not shon 
a significant difference, The change evidenced by those with wi iet 
defenses could be interpreted as a function of increased familiarity ien 
the threatening words, desensitization to the threat represented by such 


н : acceptable 
words, or as learning that the use of such words was socially accepta 
in the context of the experiment. 


1. Awareness of Anxiety 


In his dissertation research at the University of Rochester, Pomeranz 
(1963) investigated the relationship between verbalized response to — 
and repression-sensitization, Three types of movies (control, stressor, anc 
alleviation of emotional arousal) were shown, and the subjects' affective 
states during each were obtained on an Adjective Check List and a 
Anxiety Differential measure. "The subjects were 63 male Mg pF арче 
selected from a larger group on the basis of R-S scale scores, It was found 
that sensitizers indicated significantly more emotional arousal in керне 
to the stressor (movie of unpleasant surgical operation) than did re- 
pressers, 

Byrne and Sheffield (in press) 
exposing subjects to 11 vividl 
novels. Subjects were 


created a potential threat situation by 
Y descriptive sexual passages from various 
all males with 22 sensitizers and 22 repressers 
serving in the sex arousal condition, In the control group 22 sensitizers 
and 22 repressers read neutral descriptive passages from the same 11 
novels. Immediately after the reading task, all of the subjects were asked 
to fill out six five-point rating scales dealing with their feelings while 
reading the literary material, On the “Sexually Aroused" scale, ratings 
were significantly higher in the experimental than in the control group 
for both repressers and sensitizers, On the “Anxious” scale, however, there 
was a significant interaction in that repressers d 
differ in the neutral condition, but sensitizers rated themselves as sig- 
nificantly more anxious in the experimental condition. The relationship 
is shown in Fig. 2. A possible interpretation of this finding is that repres- 
sers were unaware of or were denying anxiety as a con- 
comitant of sexual excitement. It would be important, therefore, to be 
able to find some other evidence of disturbance accompanying the denial 
of anxiety. With experimental and control BroUps combined, self-ratings 
of sex arousal correlated with self-ratings of disgust (r= .84, p < .05) and 


and sensitizers did not 


any feelings of 
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anger (r = .36, p < .05) for repressers. For sensitizers, sex arousal was 
significantly related to anxiety (r = .67, p «.01) lack of boredom 
(т = —.57, p < .01), and feeling entertained (r = .33, р < .05). 

Relevant data are also presented by Lomont (1964) in an investiga- 
tion utilizing 35 hospitalized psychotic and nonpsychotic patients. During 
their first month of hospitalization, the subjects were given the R-S 
scale, a 66-item word association test containing both threatening and 
nonthreatening words, and the IPAT Self Analysis Form. The word 
association test was scored for 31 signs of disturbance including reaction 
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Fic. 2. 
material. [From Byrne and Sheffield (in press).] 
time over 2.5 seconds, blocking, and reproduction failures. A significant 
positive correlation was found between R-S scale scores and IPAT scores 
(т = .76) and a significant negative correlation between the R-S scale and 
disturbance scores on the word association test (r = —.45). Thus, sensi- 
Lizers responded with greater verbalized anxiety than repressers on a self- 
asure while the repressers indicated greater disturbance than 


report me 
sensitizers on the indirect measure. 


ЕА Physiological Response to Threat 

Lazarus and Alfert (in press) report an investigation in which cog- 
nitive appraisal of a stress situation and physiological response to that 
Stress were compared. In previous research (Lazarus, Speisman, Mordkott, 
and Davison, in press: Speisman, Lazarus, Mordkoff, and Davison, in 
press), a silent movie of a primitive subincision ritual was used to produce 
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stress reactions. In the Lazarus and Alfert study, three experimental con- 
ditions were employed: the silent film without introduction or com- 
mentary, the film with both an introduction and a movie sound track 
that emphasized denial and reaction formation, and the silent film with 
the denial and reaction formation introduction. The latter two condi- 
tions were conceptualized as possibly alternate Ways to reduce the threat 
of the film; there was denial of the painful aspects of the operation and 
of its possible physical harm and an emphasis on the joy of the native 
boys in participating in the ceremony. The subjects were 69 male students 
at the University of California. During the movie, measurements of skin 
conductance and heart rate were made continuously, Immediately after 
the film, several Psychological measures were obtained including the 
Nowlis Adjective Check List of Mood (Nowlis and Nowlis, 1956). In a 
Separate research session, the subjects were given the MMPI which was 
scored for K, Hy denial, Welsh Repression, and Repression-Sensitization. 
In general they found on both the physiological measures and mood 
ratings that the silent film condition was the most stressful and the silent 
film plus denial introduction the least stressful, Scores on the four per- 
sonality variables were divided at the median. In the various comparisons 
which were made, the K and Hy denial scales were the most effective 
predictors of individual differences. The К-5 scale did not yield as many 
significant findings (possibly because of à median split rather than the 
use of extreme groups), but it followed the same general pattern as the 
other variables. The over-all tendency was for those individuals with 
denying and repressing defenses to show higher levels of skin conductance 
and lower levels of discomfort on the verbal measures when compared 
with subjects with sensitizing defenses. The skin conductance data for 
the R-S scale are shown in Fig. 3. Even though Lazarus and Alfert found 
that those with low Scores on the R.S scale indicated less anxiety and 
depression on the Nowlis Mood Scale than did those with high scores, 
the reverse pattern was found for skin conductance, The authors con- 
clude: 
"Judging from these Nowlis patterns, verbally derived measures of 
stress response in the form of disphoric affect interacts with personality 
variables in a direction Opposite from what is found with autonomic 
indicators. High deniers refuse to admit disturbance verbally but reveal 
it autonomically, while low deniers are apt to say they are more disturbed 
while showing less autonomic reactivity," 


In his doctoral research at the University of California, Davison 
(1963) also utilized the subincision ritual Movie as a stressor. A group 
of 48 male undergraduates viewed the film weekly for a total of three 
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exposures. On each occasion, recordings of several physiological responses 
were obtained. Scores on the repression-sensitization dimension predicted 
reaction to the film in terms of skin conductance, heart rate, and bodily 
movement. Dividing the subjects into high, medium, and low groups, he 
found that the medium group was the most physiologically responsive to 
the film while the sensitizer group was the least responsive. A possible 
implication is that the development of strong consistent defenses of 
either type protects the individual from physiological stress, but that 
sensitization is more effective in this respect than repression. The 
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Fic. 3. Changes in skin conductance from baseline to least threatening and most 
threatening film conditions. [From Lazarus and Alfert (in press).] 


greatest differences were between the sensitizer group and the other two; 
Davison (1963, p. 123) suggests, 

“We have some support for the construct validity of the Byrne scale 
as a measure of sensitization in two facts: (1) the high sensitizers show 
least movement during the film and our previous findings associate 
immobility in this situation with attention; (2) the high sensitizers obtain 
the largest first-session mean on Nowlis "Activation," which on its con- 
tent would seem to reflect self-reported mental alertness and interest, 
and they demonstrate a progressive decline in this dimension. In other 
Words, they appear to be reacting to the first exposure to the film, a 
threatening situation, with mobilization of their preferred coping 
mechanisms, and this response diminishes as the film loses its threatening 


capability." 
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6. Responding to Ambiguous Stimuli 


Responses to relatively 
been found to be in part a fu 
mechanisms. Those w 
with neutral or nonth 


ambiguous or unstructured stimuli have 
nction of the subjects' characteristic esi 
ho repress tend to respond to ambiguous caliente 
reatening interpretations, while individuals enpor 
ing intellectualizing or obsessional defenses respond to such material wit ; 
conflict-laden and emotional content. Are scores on the R-S scale zaa 
to these differences? If repression-sensitization scores indicate a pona 
tendency to avoid or approach threatening stimuli and if sexual anc 
aggressive cues represent some degree of threat to almost everyone in Qus 
culture, interpretation. of Projective material as sexual or aggressive 
should be a function of R-S scale scores. 

From a group of 213 undergraduates, 29 sensitizers and 24 терезе" 
were asked to respond to nine TAT сг rds (Byrne, 1961). Each e^ 
Was scored for sexual content, aggressive content, and percentage aa 
emotional words, Neither the aggression score nor the percentage О 
emotional words was related to repression-sensitization. For both SEXE 
sensitizers had higher sexual Scores than repressers. The difference for 
males was significant at beyond the .01 level. The females had a larger 
mean difference than the males, but with a smaller N and a larger vari- 
ance the difference did not reach statistical significance. 


A TAT investigation by Tempone (1963) yielded quite different 
results. With 46 male undergraduates at the University of New Hampshire 


as subjects, stories given in response to five TAT cards were scored fo! 
aggressive and sexual content. In this stud 


(b < .05) more aggressiv 
sexual responses were n 


Y, sensitizers gave significantly 
€ content than Tepressers, but the differences 10 
ot significant. 

In a word association study, Blaylock (1963) utilized a 100-item list 
made up chiefly of homonyms with aggressive and neutral meanings. А 
group of 37 Texas undergraduates Was given this test and the R-S scale. 
Repression-sensitization was found to correlate .37 (p < .05) with number 
of stimulus words perceived as aggressive. When this investigation was 
repeated with a similar word association test administered individually, 


no relationship to R-S scale Scores was found in a group of 54 under- 
graduates. 


7. Response to Humor 


On the basis of Freudian theory, 
hypothesized differential response to hu 
repressing versus sensitizing defenses. 
individuals with low appreciation for | 


O'Connell and Peterson (1964) 
morous material as a function of 
Specifically, they predicted that 
1umor are rigid repressers, those 
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with high appreciation for humor are normals (middle part of the R-S 
scale), and that sensitizers should fall in between the other two groups in 
humor appreciation. The humor test was composed of 17 jokes and 
anecdotes which had been unanimously classified by six clinical psychol- 
ogists as falling in the category of humor (rather than aggressive or 
hostile wit or nonsense wit). In defining these categories for the judges, 
the authors indicated that “. . . humor is characterized by an objectively 
stressful situation in which the principal reacts with a jest which is 
relatively free of hostility directed toward others.” For example, one of 
the humor items was, “A condemned prisoner was being led to the 
gallows early one Monday morning. As he left his cell, he waved to the 
other prisoners and said, ‘Well, this is a good beginning to the week.’ " 
In its final form, the humor test required the subjects to respond to each 
item on a five-point scale ranging from “dislike very much” to “like very 
much”; its corrected split-half reliability was found to be .96. 


TABLE 11 
REPRESSION-SENSITIZATION AND HUMOR APPRECIATION? 
R-S scale 
uu E 
Humor appreciation N M SD 
High 54 39.20 17.82 
Medium 70 42.20 18.36 
Low 54 36.10 17.42 


a After O'Connell and Peterson (1964). 


Both the humor test and the R-S scale were administered to 178 
students at Baylor University. The sample was divided approximately 
into thirds on the basis of their humor appreciation scores. As shown in 
Table II, the mean repression-sensitization scores for these three groups 
fell in the predicted order. A simple analysis of variance did not yield a 
significant F, but group comparisons by means of t tests indicated the 
low-appreciation group Was significantly more repressing than either the 


middle- or high-appreciation groups. 


$. Conclusions 

Predictions based on the assumption that scores on the R-S scale 
indicate individual differences in the tendency to repress or deny or 
avoid threatening stimuli have been relatively well supported. There is 
evidence that behavior which clinicians define as repressive is related to 
the behavior measured by the test. When confronted by threatening or 
provoking situations, individuals on the two ends of the scale 
licted direction in terms of memory, perception, and 
these differences are not manifest in neutral 


anxiety-] 
differ in the prec 
reported anxiety; further, 
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: : = Е - ressers 
Situations. It has not yet been conclusively demonstrated that owe a 
j У : f М : ; eel- 
give physiological evidence of anxiety while verbally denying such " 
2 са ег 

ings, but the work of Lazarus and others offers some encouragem 


NS dade а E zonally dis- 
along this line. Response to projective material with emotionally 
turbing content is not consisten 


is possible that the inconsisten 
function of the degree of thre 
hypothesized, therefore, that 
and/or the accompany 
sufficiently threatening, 
to differences in respo. 
repressers as com 


tly related to repression-sensitization. It 
t results in the latter work may be а 
at provided by the stimuli. It would n 
if the content of the stimulus materia 
ing instructions and experimental setting er 
differences in repression-sensitization would sA 
nse. Finally, the relative dislike of humor b 
pared with other individuals is consonant with Freudian 
humor theory. In their excellent review of the general concept of repres- 
sion, MacKinnon and Dukes (1962) describe three experimental ap- 


тот А " А : stem, 
proaches to the study of this behavior, Within that categorization syste 
the R-S scale has been found relev 


ant in predicting (1) inhibition of 
perception, (2) inhibition of memory, and (3) inhibition of response 
The construct validity of the scale as a measure of differences in defenses 
seems moderately well established. 


B. MEASURES or SELF-DESCRIPTION 


On several bases it has been hy 
is related to measures of the self 
end of the continuum w 
of themselv 


pothesized that repression-sensitization 
concept. Individuals on the repressing 
ould be expected to be unaware of those aspects 
es which involve anxiety-arousing cues. Thus, in response р 
the typical assessment devices utilized in research on self theory, repressers 
should be less likely than sensitizers to indicate negative self attributes 
or to indicate discrepancies between their actual and ideal attributes 
Sensitizers, in contrast, would be expected not only to be aware of 


negative attributes and self-ideal discrepancies but also to dwell on them 
and even to exaggerate them. 


1. Self-Ideal Discrepancy 


A number of earlier investigations 
viduals tend to describe themselves more positively on Q-sort tasks than 
do sensitizing individuals (e.g, Block and "Thomas, 1955; Chodorkoll, 
1954). With the R-S scale, the self-ideal discrepency of repressers on 
Leary's Interpersonal Check List was found to be significantly less than 
that of sensitizers in two different samples of student nurses (Altroccht 
et al., 1960). The magnitude of the relationship is a relatively substantial 
one; on Worchel's (1957) Self-Activity Inventory (SAT) the R-S scale has 
been found to correlate with self-ideal discrepancy scores -62 (p < .01) and 


reported that repressing indi- 
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55 (p«.01) in two groups of undergraduates (Byrne, 1961) and .63 
(p < .01) in a third sample (Byrne et al., 1963). The self-ideal discrepancy 
scores on Bill's Index of Adjustment and Values and the R-S scale were 
found to correlate .61 (p < .01) in a group of 227 nursing students in 
New Jersey (Hanson, 1963). 


2. Self Concept 

Presumably, sell-ideal discrepancy can come about as a function of 
variations in either self concept or ideal concept. It is reasonable to 
inquire, then, whether represser-sensitizer differences in discrepancy are 
a function of differences in descriptions of self, ideal, or both. Altrocchi 
et al. (1960) reported that repressers and sensitizers did not differ in ideal 
self, only in self concept. Sensitizers were found to have a more negative 
self concept and hence a greater discrepancy between self and ideal. 
Sensitizers described themselves as more rebellious, aggressive, and self- 
effacing, and as less dominant than repressers. 

These findings were verified in subsequent research using the SAI. 
Each item on the latter was rated in terms of whether or not the behavior 
represented a negatively valued characteristic. On this basis, both self- 
descriptions and ideal-self descriptions could be scored in terms of 
negativeness. In two different investigations (Byrne, 1961; Byrne et al., 
1963), repression-sensitization was found not to be significantly related to 
negative ideal-self descriptions (r = .25; т = .16) but substantially related 
to negative self-descriptions (r = 66, p < .01; т = .68, p < .01). It seems 
clear that the findings regarding repression-sensitization and self-ideal 


discrepancy are a function of differences in self-descriptions. Repressers 
and sensitizers do not differ systematically in their ideals. Rather, it is 
their self-pictures which differ markedly, with repressers presenting them- 


selves positively and sensitizers negatively. 


3. Incongruency 

A somewhat different sort of self-descriptive measure is that of 
incongruency or inconsistency of percepts. As has been discussed else- 
where (Byrne et al., 1963, p. 330), | 

“McReynolds (1956) has proposed that anxiety is a function of the 
quantity of perceptual material not assimilated into perceptual systems. 
He uses the term "incongruency" to describe unassimilated material. 
Incongruent percepts are those which are difficult to harmonize because 
they are contradictory, inconsistent, or dissonant (McReynolds, 1958), 
and the existence of such percepts motivates individuals to reduce incon- 
a slightly different terminology, а stimulus is incongruent 


gruencies. In à a к 
if it elicits incompatible responses. The concept of 


for an individual 
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е " r LES TT " МИР Y ER cogni- 
meongruency is quite similar to Festinger's (1957) formulation of cog! 
tive dissonance.” 


Я Е ГИР Pe y in 
Incongruency has been operationally defined as inconsistency ih 
г - А islike) w 
self-report between values (good-bad) and feelings (like-dislike) 


incongruency, several researchers have constructed tests consisting of а 
series of items which take the form “Huntin 
"Insisting on my rights." Ss respond 
rating each activity as good or bad. L 
in terms of feelings by rating each 
Each item is then scored for 


; " and 
g for game with a gun cv 
to the items in terms of values 2 
; n 
ater, they respond to the same ite 


of ratings. A congruent response (Good-Like or Bad-Dislike) тесегмез T 
points. An incongruent response (Good-Dislike or Bad-Like) receives оле 
point. The total incongruency Score is the number of items to which s 
has responded in ап incongruent Way. Among other findings, eel 
&ruency has been found to be positively related to Scores on the Mone 
Anxiety Scale (McReynolds, 1958). Also, incongruency with respect t 

hostility was found to be inversely related to the accuracy of pau. 
of hostile cartoons while sexual incongruency Was found to be inversely 
related to the accuracy with which sexual cartoons are identified (Byrne, 
Terrill, and McReynolds, 1961). 


- м : n 3 А А uld 
Since Incongruency is conceptualized as anxiety arousing, it мо 


me pts 
follow that repressers should be less aware of their incongruent perce} 
than are sensitizers, 


А А е5 
In extending the mcongruency concept, three typ 
of incongruency were investigated, A i 


used, and subjects rated each on the b 
their values of good-bad, 
sequences as pleasant-unpleasant, Thus, thr 
possible: feelings versus values (like-bad, dislike-good), values versus con- 
sequences (good-unpleasant, bad-pleasant), 
quences (like-unpleasant, dislike-pleasant), A 


was given the R-S scale and the Hosti ity Incongruency Test. Each type of 
incongruency was significantly related to repression-sensitization: .98 
(p < .01) for feelings versus values, .33 (p< 01) for feelings versus con- 


sequences, and .19 (p < .05) for values versus consequences, Further, in 
each of the former two instances, one of the two possible types of incon- 


ips. R-S Scale scores cor- 
(т = 40, P < .01) but not 


and Tepression-sensitization 
correlated with number of items checked like-unpleasant (r = 45, p«.01) 
: а 


but not with items checked dislike-pleasant (r = -—03)- Sensitizers, then, 
А more likely to indicate that they enjoy behavior which they believe to 
аг 


of environmental con- 
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be morally wrong and which they believe is likely to have unpleasant 
environmental consequences. 


+. Hostility 

Using the Rosenzweig P-F test and a 14-item rating scale as hostility 
measures, Altrocchi, Shrauger, and McLeod (1964) found that repressers 
attributed less hostility to themselves than did sensitizers or expressors. 
In research which is as yet unpublished, Lomont (1963) investigated 
the relationships among the R-S scale, a self-report measure of hostility 
(the Buss-Durkee Inventory), and hostility as measured by the Holtzman 
Ink Blot Test. With 27 college students, repression-sensitization cor- 
related .74 (p < .01) with scores on the Buss-Durkee scale but only 
negligibly with hostility as measured by the ink blots. A more unusual 
finding, however, had to do with the fact that repression-sensitization acts 
to obscure the relationship between the two hostility variables. The 
latter do not correlate significantly (r = .28) unless scores on the R-S 
scale are partialled out; in the latter case, the two hostility measures 
correlated .49 (р < .05). Lomont concludes: 

“These results, I think, are nicely consonant with the clinical assump- 
tion that an ink blot test can tap unacknowledged hostility. In keeping 
with the implications of the clinical belief, repression appears to be an 
intervening variable which affects the correspondence between self- 
reports of aggression and aggression measures on an ink blot test. This 
picture of repression as an intervening variable implies that, given ‘a 
certain degree of self-reported aggression, the degree of hostility which 
shows up on an ink blot test is positively related to the degree of 
repression operating on the self-reported aggression. ... The results fit 
the clinical belief regarding the relation of repression to self-reported and 
ink blot aggression particularly well since R-S is highly related to self- 
reported aggression but not at all with ink blot aggression. This finding 
is in keeping with the clinical hypothesis that repression censors an 
individual's self-report of aggression but that the ink blot measure of 


aggression largely circumvents censorship. 


5. Conclusions 

In the research conducted to date, the relationship between R-S 
scale scores and measures of self-description is clearly and firmly estab- 
lished. The higher an individual's score on the R-S scale, the more likely 
he is on other instruments to indicate a discrepancy between his self 
concept and his self ideal, to describe himself in negative and self- 
depreciating terms, to report incongruencies among feelings, values, and 
environmental consequences, and more specifically to say that he enjoys 
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` . > «oh is likely to 
behavior which he believes to be morally wrong and which is likely 
have unpleasant consequences. Further 


increz the 
» as R-S scale scores increase, 
more likely an individual 15 to descril 


5 К feelings 
be his own behavior and feelit ji 
nt diflerences in the ied Ы 
т most unfavorable possible lig 


C. MALADJUSTMENT 


If each end of the repression-sensitiz Id 
extreme of the respective defensive modes, scores on the R-S scale wou f 
be expected to have a curvilinear relationship with various indices % 
psychological adjustment. Neither overintellectualization of conflicts no 
denial of them should result in optimal adjustment. 


; ; cenresents iati 
ation continuum represen 


1. Population Comparisons 


Because of the multitude of differ 
adjustment, comparison of hospit 
college students provides equiv 


: е : al- 
€nces involved besides that of e 
à Rees ants wi 
alized neuropsychiatric patients ee 
i " n 
ocal evidence at best. Nevertheless, t 


TABLE III 
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ALCOHOLICS A ND Normarse 


Normals 


Alcoholics Mean 
N M SD N M SD difference ch 
b 
Males 394 6308 17.71 69 1677 12.50 pu 
Females 230 61.80 16.20 11 16.29 2.99 
a Data from Gynther (1963). 
v p< 001. 


e p< 01. 


curvilinear hypothesis leads 


to the prediction th 
patients should have more ext 


-hiatric 
at neuropsychiatr! 
reme scores on 


the repression-sensitization 
measure and hence a larger standard deviation than normal subjects. 


Ullmann (1962) compared scores on the Е scale for 47 male Texas 
students (M = 29.38, SD = 6.50) with those of two samples (N = 90 and 
64) of male neuropsychiatric patients at the Veterar 
Hospital at Palo Alto (M = 25.74 and 25.39, sp = 11. 
patients had а significantly higher standard dev 
as predicted. In addition, mean differences w 


ns Administration 
22 and 11.44). The 
lation than the students, 


Пйегепс еге а]зо Significant; the 
patients were further toward the facilitating or sensiti 


continuum. 

In a group of 35 undergraduates, Joy (1963b) found 
9] between scores on the R-S scale and Scores on an M 
MM Alcoholic Tendency. Because of item overl 


a correlation of 
MPI scale which 
ap on the two in- 
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struments, the meaning of this finding is open to question. Confirmation 
of the relationship between sensitizing defenses and alcoholism has been 
provided, however, by Gynther (1963) at the Malcolm Bliss Mental 
Health Center in St. Louis. As part of an investigation of married and 
unmarried alcoholics which is now in progress, Dr. Gynther obtained 
R-S scale scores on 69 male and 11 female alcoholics. As shown in 
Table III, for both sexes the alcoholics are significantly more sensitizing 
on the original R-S scoring system than the college students in the 


normative sample. 


2. Deviation from the Norm 


In his work on the Deviation Hypothesis, Berg (1959) has presented 
evidence to suggest that responses to test items which differ from those of 
the total group are indicative of abnormality. He has stated the Deviation 


Hypothesis (Berg, 1957, p. 159) as follow 
erns tend to be general; hence those deviant 


“Deviant response patt 
behavior patterns which are significant for abnormality (atypicalness) 
and thus regarded as symptoms (earmarks or signs) are associated with 
other deviant response patterns which are in noncritical areas of behavior 
and which are not regarded as symptoms of personality aberration (nor as 
indicators, signs, earmarks)." 

The tendency to respond deviantly, then, should be more character- 
and sensitizers than of those falling in the middle of 
djective check list to measure deviancy was devised by 
1960) by determining the 33 most commonly and 39 
adjectives оп Gough's Adjective Check List. 


istic of repressers 
this dimension. An a 
Grigg and Thorpe ( 


least. commonly checked хевир 
Deviant scores consist of the number of adjectives of the first type not 


checked and the number of the latter that are checked. With two in- 
dependent samples of undergraduates who had taken the R-S scale, 
deviant response scores on the Grigg- Thorpe list were obtained (Byrne, 
1961). For one group the correlation was 42 (p < .01) and for the other 
-33 (p < .01) with no evidence of curvilinearity. Sensitizers tended to make 
deviant responses while repressers tended to make modal responses. It 
may be observed, though, that the commonly checked adjectives are 
primarily positive attributes and the least commonly checked adjectives 
negative attributes. If the adjective check list were viewed as simply a 
self-description measure, an alternative interpretation of the findings is 
that they represent an additional instance of sensitizers and repressers 
differing in the extent to which they describe themselves in positive or 


negative terms. 
Further evidence on this point has been reported by Lucky and 
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Grigg (1964). Two measures of deviant response tendency were sape. 
one which involved self-description (the Grigg-Thorpe adjective chec 

list) and one which did not. The latter consisted of an “ESP” task E 
which subjects indicated for each of 72 items whether they believed т 
the experimenter wanted them to mark it; deviancy scores were defined " 
the number of items that deviated 1.5 SD's from the mean number » 
items circled by the group. Again, a positive linear relationship was found 
between R-S scale scores and deviant response scores on the adjective 
check list (r = .40, b < .01). With the ESP task, however, no ana a 
to deviancy was found (т = .07). It would appear that existing data simp y 
support the relationship between Tepression-sensitization and self descrip 


à р : : ANON : ы 
Поп with no evidence of а relationship with deviant response tendency 
per se. 


d Anxiety 


Since anxiety is given a prominent oA 
psychological malfunctioning, a relationship between repression-sensitlza" 
tion and anxiety measures would be hypothesized. Extreme sensitizers 
might be expected to verbalize feelings of anxiety readily while tnus 
ling the emotional experience itself while extreme repressers should 
avoid both the verbalization and the subjective experience. Presumably, 
only the verbal expressions of anxiety are amenable to investigation. 

Ullmann and McReynolds (1963) reported 
of —50 between scores on the F-I scale and ward ratings of anxiety on 53 
neuropsychiatric patients, As predicted, anxiety is more characteristic of 
those on the facilitating or sensitizing end of the scale. As noted earlier, 


the R-S scale and the IPAT Self Analysis Form were found to correlate 
.76 ina group of 35 hospital patients (Lomont, 1964). 


Instruments such as Taylor's Manifest Anxiety Scale and the Welsh 
Anxiety Scale are built from MMPI items just as the R-5 and F-I scales 
are; further, there is even item overlap. Given these. circumstances, а 
substantial relationship between Tepression-sensitization and anxiety 
could hardly be avoided. Ullmann and McReynolds (1963) reported 
correlations between the F-I scale and Welsh Ans 
psychiatric patients and —.85 for college students, Similarly, Joy (1963b) 
found a .91 correlation between the R-S scale Ў 


Я and the MAS with under- 
graduates. Not only are sensitizers found to be more anxi 


on the repression-sensitization measure are about as highly correlated with 
these two anxiety scales as is possible with the i 


nstruments’ reliabilities. 
A few of the implications of this somewhat disturbing finding are dis- 


cussed in the final section of this chapter. 


SA y 6 st 
position in the dynamics of mo 


vd ion 
à significant correlatior 
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4. Emotional Stability and Social Approach-Withdrawal 


Weinberg (1963) carried out a factor analytic investigation of 37 
variables in which the R-S scale was included along with word association 
scores, field dependency measures, and several personality scales. The 
subjects were 62 outpatients at the UCLA Medical Center Clinics. Ten 
factors were obtained; the first four (accounting for 70% of the variance) 
were labeled Emotional Stability versus Instability, Word Association 
Style, Field Dependence, and Social Approach-Withdrawal. The R-S 
scale loaded substantially on the first and fourth of these factors. Also 
loading on Factor I with the R-S scale were Impulsivity, Cyclothymia, 
Thinking Introversion, Ma, Pd, and two depression scales, while the 
Opposite pole was represented by Achievement via Conformance and 
Responsibility. On Factor IV at the same pole as the R-S scale were 
Social Introversion, Depression, and sleeplessness while extraversion and 


rhathymia loaded in the opposite direction. 


5. MMPI Variables 

As with the anxiety measures, relationships between the R-S scale 
and other MMPI scales are complicated by overlapping items, identical 
test format, and so forth. In spite of these confounding factors, the 
findings reported by Joy (1963b) are of interest. With a group of 35 
Texas undergraduates, he obtained significant correlations with a number 
of MMPI scales as shown in Table IV. Also shown are significant cor- 
relations found by Endler (1963) with a group of 42 undergraduate males 


TABLE IV 


CORRELATIONS BETWEEN R-S SCALE AND MMPI SCALES 


т 


Joy (1963b) Endler (1963) 


MMPI Diagnostic Scales 


Depression 72 66 
Hysteria —53 01 
Masculinity-Femininity 50 — 
Psychasthenia 60 92 
Social Introversion а = 
Hypochondriasis — 25 52 
MMPI Validity Scales 
Lie —.61 —38 
Е 69 S 
K —91 —.76 
MMPI Special Scales 
Neuroticism 75 -- 


Ego Strength 
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at York University. As a brief and tentative summary, the two investiga- 
tions suggest that sensitizers, compared with repressers, tend to respond to 
the test as do members of the Opposite sex, to give unusual or uncon- 
ventional responses, to be depressed and discouraged to be anxious and 
agitated, to be socially introverted, and neurotic. In additio compared 
with repressers, sensitizers tend to | 
to have less ego strength. R 
adjustment, sensitizers tend t 
than repressers. 


be more cynical and self-debasing and 
ather than a curvilinear relationship with 
о present a less emotionally healthy picture 


6. CPI Variables 


ise c eget Inventory (CPI) (Gough, 1957) was 
R A ; mesi i variables. In contrast to the psycho- 

slat ation of the MMPI, the CPI centers on areas of behavior 
relevant to a normal population in four a 5: Poi А E lancy, and 
Self-Assurance; Socialization, Maturity, and Reano clean nent 
Potential and Intellectual Efficiency; and Intellectual and | t ; ac Modes 
On each variable, high scores indicate better alumen Шоп the tie 


TABLE V 


RErnzEssION-SE. IZATION AND CPI V. 


АМАВ 


r 


Byrne, Golightly, and 
.— Joy (19636) Sheffield (in press) 
35 Undergraduates 


Dominance = 43 Males 48 Females 
Capacity for Status m =R .16 
Sociability x —.29 
Я , —.66a 
Social Presence MULA —.320 
Self-Acceptance —33 —.24 
Sense of Well-Being — 802 —.08 
Responsibility —370 — 43a 
Socialization — 380 —.14 
Self-Control —.60a —22 
"Tolerance — 78a —.38b 
Good Impression —.67a —45a 
Communality —03 —.39a 
Achievement via Conformance —79а mt 
Achievement via Independence —.55а 77464 
Intellectual Efficiency —.55a —330 
Psychological-Mindedness —.41b m 
M pe A ~21 
Flexibility —17 — 94 
Femininity .38b a 
а p< Ol. 


d p< 05. 
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Two investigations have included the correlation of each CPI scale 
with the R-S scale in groups of undergraduates. Joy (1963b) utilized 35 
undergraduates of both sexes while Byrne, Golightly, and Sheffield (in 
press) obtained samples of 43 male and 48 female undergraduates. The 
coefficients are shown in Table V. As may be seen, all three samples 
yielded significant correlations between repression-sensitization and seven 
of the 18 CPI scales: Sociability, Sense of Well-Being, Self-Control, 
Tolerance, Good Impression, Achievement via Conformance, and Intel- 
lectual Efficiency. On the basis of the CPI manual, repressers and sensi- 
tizers would tend to be differentially described by others as in Table VI. 


TABLE VI 


Sensitizers 


Repressers 
1. Active, alert, ambitious, com- 
petitive, energetic, enterprising, 
forward, industrious, ingenious, 


productive, progressive, quick, re- 
sourceful, strict and thorough in 
their own work and in expectations 
for others, and valuing work and 
effort for its own sake 

2. Broad and varied interests, capable, 
clear-thinking, high value on cog- 
nitive and intellectual matters, 
intellectually able, intelligent, origi- 
nal and fluent in thought, valuing 
intellectual activity and intellectual 
achievement, verbally fluent, versa- 
tile, and. well-informed 

3. Calm, conscientious, deliberate, dili- 
gent, efficient, honest, inhibited, 
organized, patient, persistent, plan- 
ful, practical, responsible, self- 
denying, sincere, slow, stable, thor- 
ough, and thoughtful 

4. Concerned with making à good 
impression, cooperative, helpful, in- 
formal, outgoing, sociable, and 
tolerant 


Apathetic, excitable, lacking in 
self-direction, leisurely, overly 
judgmental in attitude, passive, 
pessimistic about their occupational 
futures, quiet, submissive, unam- 
bitious, and unassuming 


Coarse, constricted in thought and 
action, conventional and stereo- 
typed in thinking, narrow, opinion- 
ated, shallow, and shrewd 


Awkward, cautious, confused, eas- 
ily disorganized under str or 
pressures to conform, ygoing, 
impulsive, insecure, lacking in 
self-discipline, stubborn, and unin- 
hibited 


Aggressive, aloof, apologetic, asser- 
tive, cool and distant in their rela- 
tionships with others, defensive, 
detached, disbelieving and distrust- 
ful in personal and social outlook, 
irritable, overemphasizing personal 
pleasure and self-gain, overly influ- 
enced by others' reactions and opin- 
ions, retiring, self-centered, self- 
defensive, suggestible, suspicious, 
too little concerned with the needs 
and wants of others, and wary 
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"Though these data are not entirely consistent with all of the findings 
with the R-S scale reported in this chapter, they present a relatively con- 
vincing picture of Tepressers as appearing better adjusted than sensitizers. 
This characterization is even more striking in Fig. 4, in which Byrne, Go 
lightly, and Sheffield’s (in press) subjects were divided roughly into thirds 
on the basis of their R-S scale scores (29 repressers, 29 sensitizers, and 33 
neutrals) and their mean standard scores on the CPI plotted. The 


70 
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30 

DO CS SY sp SAWBRE SO sc TO GI CM AC Al IE PY FX FE 
CPI Scales 

Fic. 4. California Psychological Inventory 


Я — 
2 Profiles of ге ressers, neutrals, anc 
sensitizers. [From Byrne, Golightly, and Sheffield Р 


(їп press).] 
repressers may be seen to fall at or аһоу‹ 
zation sample with the neutrals 
on most of the scales, 


e the mean of Gough's standardi- 
and sensitizers falling progressively lower 


7. Overcontrol 


Megargee (1964) has presented ey. 


- : idence Supporting the hypothesis 
that extremely assaultive criminals (e 


B. murderers, those who commit 


ar less aggressive i ir everyday 
; ; М а ve in their every 
behavior and in test responses than qo other criminals or normals. The 


picture is one of a chronically overcontrolled individual who suddenly 
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breaks forth with extreme violence, often to the surprise of those who 
know him. 

In subsequent work, Megargee and Mendelsohn (1964) developed 
and cross-validated an MMPI scale which discriminates extremely as- 
saultive offenders from other criminals and from normals. Since repres- 
sion and denial are among the mechanisms utilized by the overcontrolled 
individual, they hypothesized a negative relationship between this new 
scale and the R-S scale. With a sample of 14 extremely assaultive, 28 
moderately assaultive, and 44 nonviolent criminals plus 50 male under- 
graduates, the two scales were found to correlate —.41 (p < .01) as pre- 
dicted. Thus, extreme repressers are seen to fit into the pattern of chronic 
overcontrol which includes rigidity, conventionality, moralistic views, 
and (at least in the criminal group) a greater probability of psychosis 
(Megargee and Mendelsohn, 1964). 


8. Conclusions 

The proposition that the two extremes of the repression-sensitization 
dimension represent different but equally maladjusted ways of respond- 
ing to anxiety and conflict is not consistent with the majority of evidence 
now available. The only supportive data are the findings which indicate 
greater variance on the defensive measure in a neuropsychiatric than in a 
college population and the correlation with the Megargee and Mendel- 
sohn measure of overcontrol. All other findings strongly suggest a linear 
relationship between sensitizing defenses and maladjustment. Scores on 
the R-S scale are positively related to tendencies toward alcoholism, the 
number of deviant responses given on an adjective check list, anxiety, 
emotional instability, social withdrawal, and a number of MMPI and 
CPI scales which indicate various types of psychological maladjustment. 
Those on the repression end of the continuum were found to be higher 
on the hysteria dimension in one investigation but not in another. Re- 
pressers also receive higher scores on a measure of ego strength. 'The weak- 
ness of the majority of this evidence is that paper-and-pencil tests are 
utilized as indicators of maladjustment, and repressers by definition are 
those who respond to such instruments by denying that anything is wrong 
with them. Whether they are presenting an accurate picture of their 
emotional health will ultimately have to be determined by means other 
than asking them again on a different paper-and-pencil test. 


D. ADDITIONAL ANTECEDENTS, CORRELATES, AND CONSEQUENTS 


1. Tendency to Give Socially Desirable Responses 


In 1953, Allen Edwards published an article which was to prove the 
precursor of a body of fruitful and at times disturbing research dealing 
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with the effect of the social desirability of the content of test items. When 
he obtained social desirability scale values for 
items, the correlation between this у 
agreeing with the item content was .87. This substantial relationship has 
been confirmed by subsequent investigations. £ 

A related, but somewhat different concept, is Edwards’ (1957) social 
desirability hypothesis, which simply Proposes the existence of individual 
differences in the tendency to give socially desirable responses to ques 
tionnaire items. He has built д 39-item Social Desirability (8D) scale E 
measure this tendency, In a number of investigations, scores on the SD 
scale have been found to correlate highly with scores on many personality 
measures, including most of the MMP] scales (Edwards, 1959). 

It would seem reasonable to expect that the R.S scale would also 
show a relationship because it is made up of MMPI items. In addition, 
individuals who utilize repressive mechanisms should stress socially 
desirable characteristics while sensitizing individuals should emphasize 
socially undesirable characteristics, With a Broup of 35 undergraduates, 


Joy (1963b) found a correlation of — 9| between the Edwards SD scale 
and the R-S scale, 


Liberty, Lunneborg, апа Atkinson (in press) conducted a factor 
analytic investigation in which several Personality measures were given to 
150 male students. The MMPI Was scored for 62 scales including the 


R-S and F-I scales. Factor analysis of the intercorrelation matrix yielded 
10 factors. Most of the variance w 


factors: social desirability, 
tization measures primaril e first of these factors. Factor 1 
was labeled social desirability bec; 


a series of questionnaire 
: subjects 
alue and the proportion of subject 


7 Pure measure of that factor. 
i includi Re (im Press) factor-analyzed 30 scales 
from eight tests, includ ing the F-] Scale, With a group of 137 University of 
А б actors identified, the Е scale had its highest 
loading on the first or social desirabi 


4 lity factor, 
Another approach to social desir. 


ability has been that of Crowne and 


Marlowe (1960), who point out that the Edwards SD scale is characterized 
by items containing primarily Pathological implications In a normal 
population, a high score, then, m бы 


' 5 e ау mean that the Subject is attempting 
to give the most socially desirable Tesponses ог that he js actually free of 
s actu: 


pathological symptoms. To remedy this Problem, they built a scale con- 
sisting of items dealing with culturally Sanctioned beh 
likely to occur. The resulting Crowne 
correlate .35 (p < .01) with the Edwards gy 

In both of the factor analytic Studies 


aviors that аге not 
arlowe Sp scale was found to 
2 scale, 


Indicated above (Liberty et als 
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REPRESSIO: 


in press; Wiggins, in press), the Crowne-Marlowe SD scale loaded high in 
à different factor from the Edwards SD scale and from the repression- 
sensitization measures. In unpublished research at the University of 
Texas, however, a significant negative correlation (r = —.37, p < .01) be- 
tween the Crowne-Marlowe SD scale and the R-S scale was found in a 


group of 115 students. 


2. Authoritarianism and Dogmatism 


In characterizing the personality dynamics of authoritarians, Adorno, 
Frenkel-Brunswik, Levinson, and Sanford (1950) stressed the mechanism 
of repression as one of the basic components. A correlation of —40 
(р < .01) between the R-S and F scales has been reported (Byrne, 1961). 
The authoritarian measure was a 10-item scale composed of five of the 
original F-scale items and the five reversed items of Couch and Keniston 
(1960). Subsequent research at Texas (Byrne and Bounds, 1964) has 
found that there is no relationship between response to the Couch and 
Keniston items and scores on the original F scale. Therefore, the meaning 
of the reported correlation is ambiguous at best. 

An unpublished study by Friedman (1963) used a 32-item F scale 
(27 original items and 5 successfully reversed ones). This scale correlated 
.28 with the R-S scale; with 33 subjects this coefficient was not significant. 
In another unpublished study, Byrne and Goldberg (1962) used the 
original F scale and found a nonsignificant correlation of .13 between 
authoritarianism and repression-sensitization in a group of 76 students. 

Rokeach (1960) in his work on open and closed mindedness has 
developed a Dogmatism scale designed to tap rigidities of both the 
political left and right. In addition, his Opinionation scales directly tap 
extreme political attitudes. In the Byrne and Goldberg investigation, in 
addition to the F scale and the R-S scale, Rokeach's Dogmatism, Left 
Opinionation, and Right Opinionation scales were administered to the 
76 subjects. Significant positive correlations were found between the R-S 
scale and dogmatism (r = 36, p « .01) and left opinionation (r = .33, 
P < .01), while repression-sensitization was not related to right opiniona- 
tion (r — .05). Joy (1963b) reported a correlation of .55 (p < .01) between 
R-S scale scores and scores on the MMPI Prejudice Scale. 

In unpublished research at York University, Sermat (1962) sought a 
relationship between repression-sensitization and belief in the likelihood 
of atomic war. Scores on this item correlated .38 (p < .05) with the R-S 
scale for 30 male students but only .02 for 30 female students. 


3. Other Personality Variables 
In the Weinberg (1963) factor analytic study cited earlier, 62 out- 
patients were given a series of measures including the R-S scale, the 
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Factors 
Myers-Briggs Type Indicator, and Guilford's Inventory of Facto! 
STDCR. As shown in Table VII, seven о 
these tests were significantly related to 
basis of these relationships, sensitizers w 
more introverted 


7 


f the nine variables measured 
Tepression-sensitization. On hoe 
ould tentatively be RM 
» Neurotic, emotionally maladjusted, and as respond 


TABLE VII 


ч T ичун "Trsrsa 
CORRELATIONS Or THE R-S ScaLE wiry THE MYeERs-Briccs AND GUILFORD 


Myers-Briggs Type Indicator 
Extraversion 


—42b 
—21 
Sensation 16 
Thinking 98с 
Judgment ^ ` 
Guilford Inventory of Factors STDCR 58» 
Social Introversion; Seclusiveness 44b 
Thinking Introversion; Reflectiveness gle 
Depression; Unhappiness; Pessimism 770 
Cycloid Disposition; Emotiona] Instability З 
Rhathymia; Carefreeness; c 


; Happy-Go-Lucky Disposition 


a After Weinberg (1963), 
5 bel. 
oP < 05. 


on the basis of perception rat 
hand, appear to be extray 
and to utilize judgment in 


К ther 
her than judgment, Repressers, on the niin 

Й 
Erted and wel] adjusted, to have self-con 
preference to Perception, 


1. Intellectual Ability 
In delineating the me, 
relationships may be 
Four investigations 


aning of test v 
аз Meaningful as the 
have included 
and an intellectual measure, Nonsignificant Corre] 
between the R.S scale and the Shipley 


graduates (r = —.15) (Byrne, 1961) h 35 hospital patients (r = = 
(Lomont, 1964), standard scores on a college entrance test with 26 ma | 
undergraduates (r = -25) (Byrne, 1961), anq college ability as measure 
by the SCAT with 30 male Students (у = 2б) and 30 female students 
(т = .03) (Sermat, 1962), and With 49 male students (ғ = .24) (Endler, 
1963). In addition, Endler (1963) failed to find a relationship between 
R-$ scale scores and High School Grade Point Average (r = 09) or College 
Final Grade Average C= —.02) with 49 first-year men at York University. 

The only suggestion of a relationship here is that of a small positive 


ation and intelligence for males. 


В rtain 
ariables, the absence of cert 


= iw 3 m riable 

the T€pression-sensitization vari d 
: m 
ations have been fot 


a а ler- 
ardord Scale with 132 und 
and wit 
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Following up this possibility, the author divided the 132 students in the 
Shipley-Hartford study into subgroups of 60 males and 72 females and 
computed the correlations separately. The 178 of —.18 and —.11 were not 
significantly different from zero. On the basis of existing data, then, the 
R-S scale and measures of intellectual ability appear to be independent. 


5. Parental Antecedents 

In an experiment reported in 1957, Dulany successfully 
patterns of repression and sensitization to subjects. With geometrical 
figures as the stimuli, perceptual repression was induced by punishing 
subjects' responses to a threatening stimulus, thus allowing competing 
responses to be reinforced via anxiety reduction. Perceptual sensitization 
was induced by punishing competing responses while allowing perceptual 
responses to the threatening stimulus to occur and hence receive reinforce- 


taught" 


ment. 

Generalizing from these findings, Byrne (in press) hypothesized that 
parental influences on the development of repressive and sensitizing 
defenses should be similar to Dulany's laboratory procedures. That is, in 
childhood repressers experienced punishment in their attempts to express 
conflictful impulses while sensitizers were permitted to express such 
material, at least at a verbal level. Thus, child-rearing attitudes falling 
along a permissive-restrictive dimension should be related to the develop- 
ment of sensitizing versus repressive mechanisms in the offspring. In three 
investigations, R-S scale scores were correlated with responses of mothers 
on the Parent Attitude Research Instrument (Schaefer and Bell, 1958), 
Subjects’ responses on the Hereford Parent Attitude Survey (Hereford, 
1963), and subjects’ perceptions of their mother’s responses to the Stan- 
ford Parent Attitude Questionnaire (Winder and Rau, 1962). The 
findings were that a relatively small but consistent group of the parental 
variables were related to repression-sensitization, as summarized in 
Table VIII. Though the findings must be cross-validated, the repressers 
indicate a home atmosphere characterized by permissiveness, acceptance, 
and confidence. Their mothers were consistent and high in self-esteem 
while the two parents had a positive affective relationship with one 
another. The sensitizers, оп the other hand, report a restrictive and 
rejecting home, and a lack of confidence in assuming the role of parent. 
Their mothers were inconsistent, low in self-esteem, and had a negative 
affective relationship with the father. It is obvious that the findings are 
quite different from those which were hypothesized. 

Another investigation of repressersensitizer differences in their 
descriptions of their parents was carried out by McDonald (in press). 
The subjects were 177 unmarried pregnant women, ranging in age from 
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14 to 39. Most of them were from lower middle class homes. On the R-S 
scale, the group's mean and standard deviation were comparable to those 
of female college students (Byrne, 1961). McDonald also identified sub- 
jects as expressors on the basis of MMPI scores: 2 Pd-(Welsh repression 


+ Hy denial). Subjects were cl 


TABLE VIII 
REPRESSION-SENSITIZATION AND CHILD-REARING ATTITUDES 


assified into four extreme groups consist- 


Repressers Sensitizers 
Males 
l. Mother permissive about sexual 1. Mother suppressive about sexual 
behavior behavior 
2. Mother perceived as having high 2. Mother perceived as having low 
self-esteem self-esteem 
3. Mother perceived as having high 3. Mother perceived as not having 
expectations of son achieving high expectations of son achiev- 
masculine role ing masculine role 
Females 
l. Mother strict 1. Mother permissive 
2. Mother does not expect deification 2. Mother expects deification of 
of parents parents 
3. Feels confident in parental role 3. Does not feel confident in paren- 
tal role 
4. Mother perceived as accepting 4. Mother peceived as rejecting 
5. Mother perceived as nonpunitive 5. Mother perceived as punitive and 
and not using physical punish- as using physical punishment 
ment 
6. Mother perceived as having high 6. Mother perceived as having low 
self-esteem self-esteem 
7. Father-Mother relationship per- 7. Father-Mother relationship per- 
ceived as positive ceived as negative 
8. Mother perceived as consistent 


ing of 34 repressers, 


fewer of the hostilit 
groups. With respect to r 
cantly less hostility to their fathers th 
the mother ratings, 
diflerent, but repressers did 
mothers th 


14 sensitizers, 23 expressors 
lizers. Each subject fi 
mother, and father. The m 
hostility items. ; 


attribute significantly 
an either the expressors or the expressor. 


d as inconsistent 


, and 34 €xpressor-sensi- 


lled ош the Interpersonal Check List for self, 

ajor emphasis of the 
would be expected, represse 
y items in the self- 


atings of p. 


investigation was on the 
rs indicated significantly 


ratings than each of the other three 
arents, repressers attributed signifi- 
an did any 
Tepressers and sensitizers 


of the other groups. On 
Were not significantly 
less hostility to their 
“sensitizers. When the 
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data were analyzed on the basis of self-parent discrepancies, the repressers 
consistently indicated the least discrepancy (especially with their mothers) 
while the expressor-sensitizers indicated the greatest discrepancy. 


6. Interpersonal Behavior 

Any pervasive personality variable, such as repression-sensitization, 
is potentially an important determiner of some aspects of interpersonal 
behavior. An individual's socially relevant motives, his perception of 
others, his response to the demands of group situations, and his effect 
on others are likely to be in part a function of his characteristic defense 
modes. 

Altrocchi (1961) asked each member of a group of student nurses 
(46 repressers and 46 sensitizers selected from a much larger group) to 
describe herself and three randomly selected classmates in terms of the 
items оп Leary's Interpersonal Check List. He found that the assumed 
dissimilarity between self and others was significantly higher for sensi- 
tizers than for repressers, regardless of the degree of actual similarity- 
similarity. Further analysis of the data indicated that the findings 
could be explained on the basis of the familiar represser-sensitizer 
dillerences in sell concept rather than differences in the perception of 
others. 

Altrocchi's Expressor Index [2 Pd-(Welsh R 4- Hy denial)] has proven 
to be a useful means of differentiating two types of sensitizing individuals. 
Altrocchi and Perlitsh (1963) hypothesized differences in attribution of 
hostility to self and others by sensitizers, repressers, expressors, and 
zers. The 101 subjects were selected from a group of 296 
senior nursing students at Duke University. Attribution of hostility was 
measured by the Love scores of Leary's Interpersonal Check List. It was 
found that expressor-sensitizers attributed more hostility to themselves 
than did repressers ог expressors, expressors attributed more hostility to 
others than did repressers, and more hostility was attributed by others to 
ехргеѕѕотѕ than to sensitizers. Altrocchi and Perlitsh (1963, p. 817) suggest, 

"As in previous studies, the present results demonstrate no difference 
and sensitizers in attribution of hostility to others, 
g that neither of these two groups is seen by 
Others as particularly hostile. Repressors do not seem to think that 
feelings of anger or hostile behavior are salient aspects of their function- 
ing... , and their associates in this study seem to agree; clinicians, 
however, do not. It remains to be demonstrated experimentally whether 
clinicians are correct in believing that hostile impulses are aroused as 
readily in repressors аз in other people, but are simply repressed." 

A subsequent investigation (Altrocchi et al., 1964) suggests that there 


expressor-sensi 


between repressors 
but add the expected findin| 
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are sex differences in the attribution of hostility to others in addition to 
differences based on defenses. 

In correlating the R-S scale with various MMPI scales, Joy (1963b) 
found repression-sensitization to correlate —.87 with leadership. Partly 
on the basis of this finding, he instituted an investigation (Joy, 19632) 
with 30 three-man groups, each composed of one represser, one sensitizer, 
and one neutral (a subject scoring in the middle quarter of the scale). 
The groups discussed a standard hum С 
each subject rated himself and the other two group members on a series 
of variables. Half of the groups were given a task orientation and half 
an ego threat orientation. The only significant finding based on ratings 
by others was that sensitizers were chosen less often as desirable work 
partners than either repressers or neutrals, He also found that repressers 


rated themselves as showing more concern for maintaining friendly rela- 
tions than did the sensitizers; n 


an relations problem, and then 


eutrals rated themselves as showing more 
leadership ability than did the two extreme groups. 


Turk (1963) investigated several as 
teams composed of a student nurse and 


were classified as sensitizers or repressers, but Turk preferred to label the 
former as having low presentational conformity and the latter high presen- 
tational conformity. The physicians were not measured on this variable 
but were assumed to vary randomly in assignment to the two types of 
nurses. Each team provided three weeks of professional health care in the 
outpatient clinic of a university hospital. After the work period, each 
subject completed a questionnaire, including questions concerning their 
enjoyment in working on the task and the amount of enjoyment 
attributed by teammates to each other. It was found that the correlation 
between the enjoyment which the physician attributed to the nurse and 
that which she herself expressed was signific 
sensitizing nurses: nurse's self-rating 
correlated .69 (p < -001) in the sensitize 
groups. While the physicians were accu 


pects of dyadic relationships in 
a student physician. The nurses 


assumed similarity in enjoyment was signific: 
repressing nurses. Physician’s self-ra 
correlated .80 (p < .001) w. and her rating of the 
physician correlated .62 (p < .01) in the repressing Broups. In the sensi- 
tizing groups, these two correlations were .35 and .30. One interpretation 

accuracy of interpersonal perception is most 


perceptual target is a sensitizer who exhibits 
consistency in expressions of affect in overt behavior and in self-ratings. 


With repressers, the overt communication of feelin and the self-ratings 
g 8: 


likely to occur when the 
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were apparently inconsistent, thus making possible distortion via assumed 
similarity. 

Silber and Baxter (1963) have supplied preliminary data from an 
investigation in which repressers and sensitizers were compared on several 
aspects of a verbal conditioning task. It seems possible to interpret their 
findings as reflecting differential responses to an interpersonal situation. 
They attempted to condition 52 undergraduates at the University of 
Kentucky to respond with either thought or motor-action verbs. On the 
basis of R-S scale scores, each subject was classified as a sensitizer or a 
represser. On index cards, the pronoun "he" appeared at the top with 
three alternative words (thinking, motor-action, and verbal activity) at 
the bottom. The subjects were instructed to make a two-word sentence, 
using the pronoun and one of the verbs. There was an operant level 
phase of 90 trials, an acquisition phase of 100 trials (^Mmm-hmm" for a 
thought verb for half the group and for a motor-action verb for the other 
half), and an extinction phase of 40 trials. A postconditioning interview 
Was conducted to determine the subjects’ awareness of the experimental 
conditions and their estimate of the frequncy of verbal reinforcements by 
the experimenter. Among the results relevant to repression-sensitization, 
Silber and Baxter found no differences between the groups in operant 
levels for the verbs, a significant interaction between repression-sensitiza- 
tion and awareness (aware repressers gave the greatest number of re- 
Sponses and aware sensitizers the smallest number) during acquisition, and 
greater extinction effects for repressers than sensitizers. Finally, compared 
With sensitizers, repressers overestimated the number of reinforcements 
emitted by the experimenter. It might be concluded that in this type of 
Social situation repressers differ from sensitizers in that they respond more 
to the implicit demands of the other person when they realize what those 
demands are. They more readily respond with a given class of words 
when the experimenter offers approval for this and more readily cease 
hen the experimenter withdraws his approval. In 
he situation such that they remember receiv- 
han was actually the case. 


this type of response w! 
addition, they misperceive t 


ing a greater number of social reinforcements tl 7 
Sensitizers, in contrast, appear to reject or resist the demands of the 


Situation, If this interpretation of the findings is an accurate one, 
repressers and sensitizers should not differ in response to an operant 
conditioning task which does not involve an interpersonal context. 
Shrader (1963) at the University of Massachusetts selected a group m 
72 male subjects from a pool of 194 who had taken a self-ideal dis- 
aire and the R-S scale. The experimental task was a 


crepancy questionn 
A apparatus in which subjects made 


modified Ames Thereness-Thatness 
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size and distance judgments of three full-face, life-sized photographs of 
college males. Each was described as a superior achiever in intellectual, 
social, or physical activities. Shrader's findings with respect to the 
defense groups were that repressers perceived the stimuli as larger and 
closer than the sensitizers and that they were more variable in their 
responses. The difference between тергеѕѕегѕ and sensitizers decreased as 
the stimulus distance increased. Shrader interpreted these results as 
meaning that the repressing individuals were more suggestible in re- 
sponding to the experimenter's instructions and that they were more 
labile in responding. The differences decreas 


ed when the stimulus was 
farther away because of the reduced threat qual 


ities at the greater distance. 
7. Obstetric Complications 


The effects of emotional stress and of personality differences in 
dealing with stress have been studied in connection with various 
obstetrical complications. Several research projects are under way to 
determine the part played by psychological variables in abortion, fetal 
mortality, labor time, and mental retardation. Recent findings suggest 
that differences along the repression-sensitization dimension are related 
to the probability of occurrence of some of these abnormalities, 

McDonald, Gynther, and Christakos (1963) obtained data on 86 
patients from the Obstetrical Clinic at the Medical College of South 
Carolina. Each patient was given the IPAT Anxiety Scale, the Kent EGY, 
and the MMPI, which was scored for repression-sensitization, On the 
basis of pregnancy, delivery-room, and post 
were classified as normal (N = 42 
consisted of disorders of 
malities observed at birth, and delivery irregul 
patients did not differ significantly 
Kent EGY Scores, or age. They wi 


partum records, the patients 
) or abnormal (N — 44). The latter 
the gestation period, developmental abnor- 
arities. The two groups of 
in infant birth weight, labor time, 

ere, however, different at beyond the 
01 level of significance in anxiety and repression-sensitization. The 
abnormal birth Broup was more anxious and more sensitizing. In the 
normal group, R-S scale scores were comparable to those of university 
coeds (Byrne, 1961) while patients in the abnormal group had a mean 
score higher than 85% of the women in the university sample. 

А One of the questions which arises from that finding is whether the 
high sensitization scores represent the characteristic functioning of these 
women in the abnormal group or whether they represent a special condi- 
tion induced in part by the stress of Pregnancy. McDonald and Parham 
(in press) investigated 160 unmarried women before and after the birth of 
their first child. The MMPI and Kent EGY Were administered at the begin- 
ing of the seventh month of gestation; shortly after delivery (7-10 days) the 
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MMPI was readministered. As in the previous investigation, patients were 
divided into groups of 79 with normal and 81 with abnormal births. 
Comparison of test scores before and after delivery revealed that pa- 
tients had significantly lower R-S scale scores (about 8 points) after the 
the baby was delivered. Comparison of the normal and abnormal groups 
indicated that they did not differ in IQ or age. Both groups were sig- 
nificantly less sensitizing after delivery, but at both periods the abnormal 
group was significantly more sensitizing than the normal group. They 
differed by 5.50 points before delivery and by 1.65 points after delivery. 
As the authors indicate, the reasons for the differences in ego defenses in 
the normal and abnormal groups and the shift toward more repressive- 


ness after delivery are not clear. 


8. Conclusions 


To date, a number of relationships have been established between 


repression-sensitization and other personality variables. That the Social 


Desirability variable is negatively related to the repression-sensitization 
dimension is well established, but the meaning of this relationship is 
erpretation. The R-S scale appears to be un- 
related to authoritarianism, right opinionation, or intelligence, but 
positively related to dogmatism, left opinionation, and prejudice. A 
or the latter three unexpected findings has not 


theoretical explanation fc 
been offered. There are, on the contrary, quite reasonable grounds for 
expecting that individuals who utilize repressive defenses would be 


prejudiced, dogmatic, and opinionated. To find that it is those with 
sensitizing defenses who tend to be high on those three dimensions will 
require reevaluation of existing formulations. The substantial relation- 
ships between repression-sensitization and various measures of introver- 
aise the possibility of interconnections between the 
and that of Eysenck. 


open to more than one int 


slon-extraversion т 


type of work described in this chapter T^ 
The attempts to extend knowledge about the repression-sensitization 


ating its antecedents in child-rearing attitudes and 
its consequents in interpersonal behavior and somatic functioning терге- 
Sent a potentially important type of research with personality variables. 

The area of personality shows increasing maturity as the antecedents 
and consequents of individual differences become known. With respect 
to repression-sensitization, it may be fair to state that nothing is yet 
known about its antecedents. The reported investigations can be inter- 
preted most parsimoniously as relationships between the R-S scale and 
particular types of self-report measures. The fact that repressers and 
Sensitizers report different things about their parents is informative, but it 


quite probably provides no evidence 


dimension by investig: 


about actual parental differences in 
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attitudes or behavior. Some of the findings concerning consequents in 
interpersonal behavior are subject to the same comments. However, the 
results indicating that sensitzers were not chosen as work partners, that 
perception by outsiders of the feelings of sensitizers was more accurate 
and less distorted than the perception of the feelings of repressers, that 
repressers and sensitizers differ in their responses in an operant condi- 
tioning task, and that they differ in judgments of size and distance of 
photographs, give promise of fruitful research possibilities in this area. 
Similarly, when sensitizing women are found to have greater difficulty 
in giving birth to their infants, it is clear that more is involved in re- 
presser-sensitizer differences than simply verbal response sets on paper- 
and-pencil tests. A variable is “good” to the extent that it is related to 


other variables; repression-sensitization appears to qualify for that desig- 
nation. 


ш. Repression-Sensitization: What Are the Next Research Steps? 


The research findings with the 


repression-sensitization dimension 
lead to two major questions 


about subsequent developments in this area. 
First, is the present conceptualization of this variable as involving indi- 
vidual differences in approach and avoidance defenses a reasonable one 
ог does some other concept better fit the dat 


a? Second, what are the next 
steps in theory and research to be taken w 


ith this variable? 


A. Мнат Is BEING MEASURED BY THE R-S ScALE? 


It seems prudent to view any investigation which uses a personality 
test as constituting, at least in part, a construct validity study. In the tra- 
ditional psychometric sense, there is literally no Way of validating or 
invalidating a personality test, Instead, the instrument may be evaluated 
with respect to its measurement consistency or reliability, its relationships 
with other personality variables may be determined, and its antecedents 
and consequents may be specified, It may then be di: 
able measure or shelved by researchers be 
The nearest thing to invalidation that c 
of the test. As the author (Byrne, 


arded as an unreli- 
cause it lacks relational fertility. 
àn occur is a change in the name 
1964, p. 51) has suggested elsewhere: 

"Another Way of asking the validational question is, "What is the 
best name to apply to the responses measured by this instrument?’ As 
simple (or simple-minded) as this question appears, much of the con- 
troversy centering around many tests resolyes itself to the naming prob- 
lem. Hopefully, the name designated for any measured behavior may be 
thought of as tentative. Perhaps the following qualifying statement should 
be assumed wherever the name of a test appears: ‘in view of the item 
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content of this test and its relationships with other variables, at this time 
the instrument appears to be conceptualized as a measure of ur 

If one is able to tolerate a certain amount of confusion and am- 
biguity, the characteristics of the variable being measured may be made 
more clear after a sufficient period of research activity. In the research 
reviewed in the previous section, for example, the dimension which we 
have discussed as repression-sensitization has been designated by others 
as facilitation-inhibition, affector-effector, and presentational conformity. 
In addition, evidence has been presented which suggests that the labels 
self concept, anxiety, neuroticism, or social desirability would be reason- 
able alternates. To assert that the R-S scale is only measuring response 
set or really measuring anxiety, as if these new labels somehow brought 
about conceptual clarity or as if existing data were no longer meaning- 


ful, is a sort of word magic. 
Do the labels or the test names matter then? Would it not be wise to 


use nonsense syllables or zip codes as the names of personality variables 
and thus side-step the problem? This approach seems unwise because the 
name of the variable and the surplus meaning attached to that name 
are likely to have a great influence on the type of research that is con- 
ducted and on the theoretical structure into which the findings are placed. 
Differences in labels and theory mean that data may be sought which 
would otherwise seem irrelevant. Farber (1964, p. 36), for example, points 
out that the fruitfulness of the MAS is its supposed relevance as a measure 
of drive in Hull-Spence theory. Even though the MAS and Edwards Social 
Desirability Scale correlate —.84, he suggests that it is doubtful whether 
investigators using the latter variable would have conducted studies "of 
€yelid conditioning, or psychophysical functions, or the steepness of 
stimulus generalization gradients, or paired-associate learning. : 

As with the R-S scale, users of the MAS point to the very high rela- 
ong the MAS, the Pt and К scales of the MMPI, 


tionships reported am 
| Scale of Edwards. Farber (1964, p. 30) asks, 


and the Social Desirability 
“What, then, does the test measure? 
“In one sense, this is a trivial question 
answer, though, unfortunately, one that constructors and users of tests 


sometimes fail to see, namely, that giving tests different names does not 
guarantee that they reflect different characteristics, and giving them the 
same names does not necessarily mean they reflect the same determinants. 
Tf all these highly interrelated measures are related to all other kinds of 
behaviors in the same way, they measure the same thing, regardless of 


their labels. 
“Occasionally, however, 
inferred from behavior is not merely nom 


, and requires but a trivial 


this confusion among the characteristics 
inal. For instance, height and 
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weight are highly correlated in the general population, yet no one sup- 
poses they are merely different names for the same thing. What if defen- 
siveness and desire to make a favorable impression are independent, but 
nevertheless empirically related in a given population? How, then, could 
we decide whether they reflect different organismic states or processes? 
Or better, if the one measure is related to some other mode of behavior, 
which hypothetical variable is responsible for the relation? | 

"These are not trivial questions, and their answers are not easily 
come by. One kind of answer is simply the observ; 


ation that there is never 
any guarantee in science against the inaccurate identification of deter- 
minants.” 


Farber (1964, р. 31) goes on to suggest: 


"It is usually possible, given skill and patience, to tease out ever 
finer specifications of the v; 


ariables entering into behavioral laws. If 
we consider response-defined variables, by careful selection of cases, 


statistical correction, or the elimination of errors of measurement, we can 
frequently decide which of several variables, even though they be highly 
correlated, determine the form of the relation under consideration. If it 
turns out, as it frequently does, that a given behavior is a joint function 
of several response variables, the circumstance is one quite familiar to 
experimentalists, For instance, no one is unduly disturbed by the finding 
that the probability of drinking is a function of both dryness of the 


mucous membranes and general water deficit, The additional fact that 


these two variables are themselves frequently related, under many condi- 
tions, is a complication, to be sure, but not an insuperable one. 

“All we can reasonably ask in regard to either experimental or dif- 
ferential variables is that they be specified as precisely as possible. We 
should then be prepared to discover, soon enough, 
detailed specifications аге necessary," 

With the repression-sensitiz 
to be rather clear-cut that the c 


that more precise and 


ation measures, the evidence would appear 


lefense scales and the Edwards SD scale are 
essentially interchangeable, two different names for the same variable. 


Even though this interpretation seems obvious, Ullmann and McReynolds 
(1963) have presented data which indicate otherwise, As reported earlier, 
they found а — 50 correlation between the F-I scale and ward ratings of 
anxiety in a group of neuropsychiatric patients. This relationship was 
also computed with Scores on the Edwards 5D scale held constant by 
means of partial correlation, The resulting coefficient was still а signi- 
ficant one (r = —38, p <.01), which Suggests clearly that something is 
being measured by the defense test in addition to social desi ability. 
Given such a finding plus the many studies which contribute to the 
construct validity of the instrument, at the present time the label 
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"repression-sensitization" does not appear inconsistent with what is 
known about the R-S scale. There is as yet no compelling reason to re- 
name the test. 


B. CONSIDERATIONS IN BUILDING A THEORY ОЕ REPRESSION- 
SENSITIZATION 

The history of the field of personality may be seen as containing the 
discarded remains of a vast number of variables investigated over the 
years—each measured more or less adequately, each the object of a flurry 
of research interest, and each passing out of favor. Except at the methodo- 
logical level, it is difficult for even the most hardened optimist to view 
all of this activity as representing the forward motion of behavioral 
science with each generation more enlightened than the previous one. If 
this characterization can be accepted, it may be helpful to consider the 
nature of research and theory building at a more general level than is 
represented by the specific problem of repression-sensitization. What is 
involved in pursuing an area of research such as that described in the 
present chapter and what would it mean to proceed with the building of 


a theory of repression-sensitization? 


1. General Considerations 

The goal of that branch of science which is psychology is the build- 
ing of theories which permit the accurate prediction of behavior. Beyond 
à general acceptance of "the scientific method," however, there are a 
diversity of positions taken by psychologists with respect to the strategy 
and tactics best suited to attaining that goal. One of the diflerences among 
psychologists is in their conception of the role of theory in research. This 
question includes the assumptions which underlie a theory and guide its 
Beneral framework (e.g. self theory with a phenomenological base versus 
S-R theory with a mechanistic determinism), the point at which theory 
assumes importance in research (eg. prior to the design of the first 
after a long period of data gathering), the type 
of theory which is constructed. (e.g., artistic description versus mathe- 
matical relationships among operationalized variables), and the breadth 
of the theory (e.g. а global framework which encompasses all of behavior 
versus relatively narrow theories tied to a specific series of behaviors). 


Each of these issues will be considered briefly. 


experiment versus only 


2. Pretheoretical Assumptions 

o mean quite different things to diflerent 
riptions of the kinds of assumptions held 
al, behavioristic, operational tradition 


Behavioral science seems t 
individuals. One of the best desc 
by psychologists in the experiment 
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has been presented by Farber (1964). The basic assumption is that be- 
havior is a function of its antecedents (i.e, determinism). Farber (1964, 
p- 32) says, . 
"What they come to, in a word, is an emphasis on the formulation 
of empirical laws and the analysis of the variables comprising them. As 
we have noted, the assumption that it is the business of behavioral science 
to explicate the relations between objectively defined environmental and 
behavioral events is in itself no theory, but rather a metatheoretical or 
pretheoretical preference. Those who adopt this approach, especially if 
they have an interest in the phenomena of learning, are likely to use the 
terms ‘S’ and ‘R’ to refer, respectively, to the environmental and be- 
havioral events, апа the familiar formula ‘S-R’ to indicate a relation 
between these two classes of variables." 
From this frame of referen 
able should be placed within 
possible to predict individual 
antecedents are known and t 


ce, then, the repression-sensitization vari- 
a framework of empirical laws, It will be 
differences along this dimension when its 
9 predict the effects of these differences on 
other behavior when its correlates and consequents are known, Our goal 
is not the Verstehen of the Romantic movement which would purport to 
provide an understanding of repression-sensitization, but rather the goal 
is to place this variable in a predictive framework as Holt (1962) has 
cogently argued. The end point will be the specification of lawful rela- 


tionships among variables rather than a colorful literary description of 
repressers and sensitizers, 


3. Time for Theory Building 


In a sense, it is virtu 


ally impossible to design 
and execute it without th 


any piece of research 
€ operation of at least 


an implicit theory. Vari- 
ables are seldom selected through the use of tables of random numbers. 
Beyond one's pretheoretical assumptions, there are expectations or 
hypotheses or hunches or general notions 


about relationships between 
X and Y. With the R-S Scale, the author's initial expectations may be 
summarized briefly: 


(1) Individuals may be placed 
their characteristic learned respons 
mechanisms define one end of this 
nisms the other, 

(2) A curvilinear relationship should be found between an indi- 


vidual's position on this continuum and various indices of his maladjust- 
ment. | 


along a continuum with respect to 
e to threatening stimuli; avoiding 


continuum and approaching mecha- 


АП of the research on repression-sensitiz 


ation which has been con- 
ducted by the author and much of the res 


earch conducted by others 
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follow from these two propositions. There is a great deal of support for 
the first and only modest support for the second. In addition to these find- 
ings, there are a number of other relationships which have been investi- 
gated and reported. Is it now time to tie all of these findings together 
into a more inclusive theory of repression-sensitization? It would certainly 
be the “socially acceptable” thing to do in terms of the mores of the field 
of personality. Perhaps, however, it is time to offer a dissenting opinion. 
It is instructive to compare work in personality with that of other scien- 
tists, including some of our psychological colleagues. Among the basic 
differences between the work of astronomers, biologists, physicists, chem- 
ists, and even learning theorists and the work of personality psychologists 
isa superficially simple one. In the other fields, the number of different 
variables is relatively small and there is relatively general agreement 
within the field as to what those variables are. Individual scientists may 
differ in what they do with the variables and certainly in how they inter- 
pret the findings, but there is a great deal of agreement otherwise. In per- 
sonality research, the mode is an unenviable creativity with respect to 
both stimulus and response variables. It is difficult to find as many 
as half a dozen studies in our journals which utilize the same independent 
or dependent variables. It would be a disheartening exercise to survey 
and list the number and variety of different defining operations for hos- 
tility, anxiety, threat, or any other variables widely studied in personality. 

At some point in the development of our field, this way of proceeding 
must change. Personality psychologists must find their equivalents of 
planetary motion, speed of a falling body, running speed in a straight 
alley runway, and frequency of the lever-pressing response. The next step 
will be the establishment of empirical laws which indicate the relation- 
ships between and among these variables. A law relating repression- 
sensitization to amount of anxiety aroused in response to threat, for 
example, will not arise from а series of investigations each of which in- 
volves the use of different operations to define anxiety and threat, each 
significant at least at the .05 level, and each conducted only a single time. 
When a series of such laws are in fact established, it will be the appro- 
priate time to devise theories in an attempt to organize these laws into 


More inclusive units. : " 
sible to formulate simple predictions and 


In the meantime, it is pos : 
ions from observation, deductions from 


hypothe eralizati 

NESES based on gen a : 

Other theories, and inductions from previous research. Any attempt to 
, 


get more elaborate in our preliminary theorizing would seem to реш 
unfruitful endeavor on the basis of past experience. Many possibilities 
for research have been raised by those working with repression-sensitlza- 
tion and interpersonal behavior, obstetric complications, and maladjust- 
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ment, for example. There would seem to be little advantage in jumping 
1 2 


from these interesting possibilities to a formal theory without pausing 
first to obtain much more extensive data. 


4. Type of Theory 


For many psychologists, the field of personality has been synonymous 
with broad descriptive formulations employing anecdote, allusion, meta- 
phor, and analogy while eschewing operational definitions and mathe- 
matical formulas. Obviously, the typical investigator in personality re- 
search does not match that characterization. It seems equally clear that 
the individual research efforts, the emerging laws, and the future theories 
of personality will bear considerably more resemblance to physics than 
to psychoanalysis. 

For most behavioral scientists 


‚ these points are either too absurd to 
consider seriously or too obvious 


to bear further elaboration. 


5. Breadth of Theories 


Again, existing personality “theories” 
breadth. Typically, an attempt is made to explain all of man’s behavior 
in terms of a given series of descriptive formulations, Moreover, the 
crowning achievements of other fields tend to be theories which are ex- 
tremely broad and hence able to unify a multitude of established findings: 


Geocentrism, evolution, and e = mc? are familiar examples. 


Perhaps it would be prudent for pers 
somewhat less ambitious theory building at this point in the development 
of knowledge about human behavior. The necessary combination of indi- 
vidual genius and a suitable body of data apparently has a fairly low 
base rate. Instances of the former without the latter have been charac- 
teristic of personality psychology, and Sigmund Freud is the outstanding 
example. Instances of the latter without the 


Sciences. One of the reasons for the rarity of the successful broadly en- 
compassing theory is that error is increasingly easy as one moves from the 
data on which the laws were built. As Rogers (1959, p. 191) puts it, 


ау make little difference in providing 
ed facts out of Which the theory grew. But 


d to explain more remote phenomena, the 
and the inference from the theory may be com- 
pletely false. |, Thus, every theory deserves the greatest respect in the 
area from which it was drawn from the facts and a decreasing degree 


of respect as it makes predictions in areas more and more remote from 
its origin.” 


are characterized by their 


onality psychologists to attempt 
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What, then, might our theoretical goals be? Elsewhere the author 
suggests (Byrne, in preparation), 

"It would seem that a potentially fruitful approach to theory build- 
ing in the science of personality is the construction of small theories 
dealing with specific and limited behavioral events rather than a general 
theory of personality. Personality theories should be limited in scope but 
predictively powerful. As tight and accurate theories are built in areas 
such as achievment, authoritarianism, and self-consistency, the finding 
of still more general relationships will tie them together into increasingly 
inclusive overall theories. This, however, is an end-point rather than a 
starting point. First, we need to devise psychological laws analogous to 
those of the pendulum, of gases, of the lever, of the effect of gravity on 
momentum, etc. As satisfying and useful as it would be to have a com- 
plete theory of behavior immediately available, a more limited and more 
attainable goal should be sought for the present." 
the comments in this concluding section will be viewed 
in work on repression-sensitization and 
not as a criticism of any of the work which has been reported. On the 
contrary, it seems that a great many investigators in a wide variety of 
locations are engaged in worthwhile and exciting research on this dimen- 
sion. It is clearly possible to continue these research efforts in such a way 
as to extend our knowledge about one aspect of personality. And, oddly 
enough, if we do our work well, it is inevitable that the knowledge will 


have relevance to all aspects of personality. 


Hopefully, 
as a blueprint for future progress 


ACKNOWLEDGMENTS 


A great many individuals were extremely helpful in providing suggestions, reprints, 
unpublished data, reports on research in progress, and doctoral dissertations. In addi- 
tion, a number of individuals were kind enough to offer suggestions and criticisms 
of the original manuscript. The contributions of the following investigators to this 
chapter are gratefully acknowledged: John Altrocchi, James С. Baxter, Barbara Blay- 
lock, Leslie Davison, Norman S. Endler, Tom Friedman, John T. Gossett, Austin 
Grigg, Malcolm D. Gynther, Judith E. Hanson, Vernon Joy, Richard darut Paul 
Liberty, James F. Lomont, Arthur W. Lucky, Robert L. McDonald, Louis J. Moran, 
Walter O'Connell, Vello Sermat, William K. Shrader, Vincent J. Tempone, Herman 
Turk, Leonard P. Ullmann, Norris H. Weinberg, and Jerry S. Wiggins. 


References 

Adorno, T. W., Frenkel-Brunswik, Else, Levinson, D. J., and Sanford, R. N. The 
authoritarian personality. New York: Harper, 1950. - 

Altrocchi J Interpersonal perceptions of repressers and sensitizers and component 
analysis of assumed dissimilarity scores. J. abnorm. soc. Psychol, 1961, 62, 528-534. 

Altrocchi J» and Perlitsch, Hilda D. Ego control patterns and attribution of hostility. 


Psychol. Rep., 1963, 12, 811-818. 


216 DONN BYRNE 


Altrocchi, J., Parsons, O. A., and Dickoff, Hilda. Changes in Seb ides] discrepancy in 
repressors and sensitizers. J. abnorm. soc. Psychol., 1960, 61, 67-72. Y: i 

Altrocchi, J., Shrauger, S., and McLeod, Mary A. Attribution of hostility to gerr and 
others by expressors, sensitizers, and repressors. J. clin. Psychol., 1964, 20, 239. . 

Berg, I. A. Deviant responses and deviant people: the formulation of the Deviation 
Hypothesis. J. counsel. Psychol., 1957, 4, 154-161. 

Berg, I. A. The unimportance of test item content. In В. М. Bass and 1. А. Berg (eds) 
Objective approaches to personality assessment, Princeton, New Jersey: Van Nos- 
strand, 1959. Pp. 83-99, 

Blaylock, Barbara A. H. Repression-sensitization 
dental recall. Unpublished master's thesis, 

Block, J., and Thomas, Н. Is satisfaction w 
soc. Psychol, 1955, 51, 254-259, 

Brown, J. S. The motivation of behavior. New York: McGraw-Hill, 1961, P 

Bruner, J. S., and Goodman, C. C. Value and need as organizing factors in perception. 
J. abnorm. soc. Psychol., 1947, 42, 33-44, 

Bruner, J. S., and Postman, L. Tension and tension 
perception. J. Pers., 1947, 15, 300-308. (a) 

Bruner, J. S., and Postman, L 
1947, 16, 69-77. (b) 

Byrne, D, The Repression-Sensitization 
J. Pers., 1961, 29, 334-349, 

Byrne, D. Assessing personality variables 
D. Byrne (Eds.), Personality change. 

Byrne, D. Child-rearing antecec 
press, 

Byrne, D. Personality: Am approach to the 
Cliffs, New Jersey: Prentice-Hall, in preparation. 

Byrne, D., and Bounds, C. The revers; 

Byrne, D., and Goldberg, June. Repre: 
and political attitudes. Unpublish 

Byrne, D., and Holcomb, Joan. T Y ОЁ a response Measure: Differential 
recognition-threshold Scores. Ps 


ychol. Bull., 1962, 59, 70.73. 
Byrne, D., and Sheffield, J- Response to sexu. 
repressing 


and sensitizing defenses, J. abnorm, Psychol., in press. 

Byrne, D., Terrill, J., and McReynolds, P, Incongruency as 
humor. J, abnorm. soc. Psychol., 1961, 62, 435-438. 

Byrne, D., Barry, J., and Nelson, D, Relation of the rev. 
Scale to measures of self-description, Psychol, Rep., 

Byrne, D., Golightly, Carole, and Sheffield, J. The Repression-Sensitization Scale 
measure of adjustment: Relationship with the CPI, J. consult. Psychol., in press, 

Carlson, V, R. Individual differences in the recall of Word-association-test words. 
J. Pers., 1954, 93, 77.87. 

Carpenter, B. Wiener, M., 
defense behavior. J. abn 1956, 52, 380-383. 


Chodorkoff, B, Self-perception, perceptual defense, and adjustment. J. abnorm. soc. 
Psychol, 1954, 49, 508-519. | 


Couch, А„ and Keniston, К. Yeasa 
personality variable, J. abnorm. soc, Psychol, 
Crowne, D. P., and Marlowe, D. А. A new 
Psychopathology. J, consult. Psychol., 


» Word association responses, and inci- 
Univer. of Texas, Austin, Texas, 1963. 
ith self a measure of adjustment? J. abnorm. 


release as organizing factors in 
. Emotional selectivity in perception and reaction. J. Pers., 


Scale: Rationale, reliability, and validity. 


and their alteration. In P. Worchel and 
New York: Wiley, 1964. Pp. 38-68. | 
lents of Tepression-sensitization, Child Develpm., in 


science of human behavior. Englewood 


ally arousing stimuli as a function of 
à predictor of response to 


ised Repression-Sensitization 
1963, 13, 323.334. 


and Carpenter, Janeth T, 


Predictability of perceptual 
orm. soc. Psychol., 


yers and naysayers: Agreeing response set as a 
„ 1960, 60, 151-174. 
Scale of social 


desirability independent of 
1960, 24, 349.354 


REPRESSION-SENSITIZATION 217 


Davison, L. A. Adaptation to a threatening stimulus. Unpublished doctoral dissertation, 
Univer. of California, Berkeley, California, 1963. 

Dulany, D. E., Jr. Avoidance learning of perceptual defense and vigilance. J. abnorm. 
soc. Psychol., 1957, 55, 333-338. 

Edwards, A. L. The relationship between the judged desirability of a trait and the 
probability that the trait will be endorsed. J. appl. Psychol., 1953, 37, 90-93. 
Edwards, A. L. The social desirability variable in personality assessment and research. 

New York: Dryden, 1957. 
Edwards, A. L. Social desirability and personality test construction. In B. M. Bass and 
Т. A. Berg (Eds.), Objective approaches to personality assessment. Princeton, New 


Jersey: Van Nostrand, 1959. Pp. 100-118. 

Endler, N. S. Personal communication, 1963. 

Eriksen, C. W. Perceptual defense as a function of unacceptable needs. Unpublished 
doctoral dissertation, Stanford Univer., Stanford, California, 1950. 

Eriksen, C. W. Some implications for TAT interpretation arising from need and per- 
ception experiments. J. Pers., 1951, 19, 282-288. 

Eriksen, C. W. Defense against ego-threat in memory and perception. J. abnorm. soc. 
Psychol., 1952, 47, 230-235. (a) 

Eriksen, C. W. Individual differences in defensive forgetting. J. exp. Psychol, 1952, 
44, 442-446. (b) 

Eriksen, C. W. Psychological defenses and “ego-strength” 
and incompleted tasks. J. abnorm. soc. Psychol., 1954, 49, 45-50. 

Eriksen, С, W., and Browne, С. T. An experimental and theoretical analysis of per- 
ceptual defense. J. abnorm. soc. Psychol., 1956, 52, 224-230. 

Eriksen, С. W., and Davids, A. The meaning and clinical validity of the Taylor 
anxiety scale and the hysteria-psychasthenia scales from the MMPI. J. abnorm. 


soc. Psychol., 1955, 50, 135-137. 

Eriksen, C. W., and Kuethe, J. L. Avoidance 
awareness: A paradigm of repression. J. а 

Eriksen, C. W., and Lazarus, К. S. Perceptual defense and projective 
soc. Psychol., 1952, 47, 302-308. 

Eriksen, С. W., Kuethe, J. 1, and Sullivan, D. F. Some personality correlates of learn- 
ing without awareness. J. Pers, 1958, 26, 216-228. 

Farber, 1, E, A framework for the study of personality as a behavioral science. In 
P. Worchel and D. Byrne (Eds), Personality change. New York: Wiley, 1964. Pp. 3- 


37. 
Fi tinger, L, A theory of cognitive dissonance. 
Friedman, s, T. The relationship between atti 
anism. Unpublished manuscript, 1963. 
Gordon, J. E. Interpersonal predictions of repressors 
686-698. 
Gordon, J, E, The stability of th 


sensitizers. J. Pers., 1959, 27, 362-373. | | 
Gossett, J. T. An experimental demonstration of Freudian repression proper. Unpub- 


lished doctoral dissertation, Univer. of Arkansas, Fayetteville, Arkansas, 1964. 
Gough, H. G. California Psychological Inventory manual, Palo Alto: Consulting 


in the recall of completed 


conditioning of verbal behavior without 
bnorm. soc. Psychol., 1956, 53, 203-209. 
tests. J. abnorm. 


Stanford: Stanford Univer. Press, 1957. 
tudes toward children and authoritari- 


and sensitizers. J. Pers., 1957, 25, 


c assumed similarity response set in repressors and 


Psychologists Press, 1957. ў | | 
Crigg, А, E., and Thorpe, J. 5. Deviant responses in college adjustment clients: А test 
Of Berg's Deviation Hypothesis. J. consult. Psychol., 1960, 24, 92-94. 
Gynther, M, D. Personal communication, 1963. 
Hanson, Judith E. Personal communication, 1963. 


218 DONN BYRNE 


Hathaway, S. R., and McKinley, J. C. Manual for the Minnesota Multiphasic Person- 
ality Inventory. (Rev. ed.) New York: Psychological Corporation, 1951. | | 
Hereford, С. F. Changing parental attitudes through group discussion. Austin: Univer. 

of Texas Press, 1963. 
Holt, R. R. Individuality and generalization in the psychology of personality. J. Pers., 
1962, 30, 377-404. | 
Joy, V. L. Repression-sensitization, personality, and interpersonal behavior. Unpub- 
lished doctoral dissertation, Univer. of Texas, Austin, Texas, 1963. (a) 
Joy, V. L. Repression-sensitization and interpersonal behavior. Paper read at Amer. 
Psychol. Ass, Philadelphia, August, 1963. (b) 
Kissin, B., Gottesfeld, H., and Dickes, R. Inhibition and t 
sexually charged words. J. Psychol., 1957, 43, 333-339. 
Kogan, N. Authoritarianism and repression. J. abnorm. soc. Psychol., 1956, 53, 34-37. 
Kurland, S. H. The lack of generality in defense mechanisms 
perception. J. abnorm. soc. Psychol., 1954, 49, 173-177. 
azarus, R. S., and Alfert, Elizabeth. The short circuiting of thre: 
altering cognitive appraisal. J. abnorm. soc. Psychol, in press. 
Lazarus, R. $, and Longo, N. The consistency of psychological defense against threat. 
J. abnorm. soc. Psychol., 1953, 48, 495-499. 
Lazarus, R. S., Eriksen, C. W., and Fonda, C. P. Personality dynamics and auditory 
perceptual recognition. J. Pers., 1951, 19, 471-482. 
Lazarus, R. S., Speisman, J. C., Mordkoff, A. M., and Davy. 
of psychological stress produced by 


achistoscopic thresholds for 


as indicated in auditory 


B at by experimentally 


ison, L. A. A laboratory study 
a motion picture film. Psychol. Monogr. in 


press. 
Liberty, Р. G., Jr. Lunneborg, C. E., and Atkinson, G. C. Perceptual defense, dis- 
simulation 


‚ and response styles. J. consult. Psychol. 

Lomont, J. F. Personal communication, 1963. 

Lomont, J. F. The repression-sensitization dimension in relation to 
Unpublished manuscript, 1964. 

Lucky, A. W., and Grigg, A. E. Repression-sensitization as ay 


responding. J. clin. Psychol., 1964, 20, 92-93. 
McDonald, R. L. 


, in press. 
anxiety responses. 


ariable in deviant 


Ego control patterns and attribu 
J. abnorm. soc. Psychol., in press. 
McDonald, R. 


tion of hostility to self and others. 


‚ and Parham, K. J. Rel 
to obstetric complic 
press. 


McDonald, R. L., Gyther, M. D., and Christakos, 


anxiety and obstetric complications. Psychosom. Med., 1963, 25, 357-363. 
MacKinnon, D. W., and Dukes, W. F. Repression, In L. Postman (Ed.), Psychology in 
the making. New York: Knopf, 1962, Pp. 662-744. 
McReynolds, P. A restricted conceptualization of human anxiety and motivation. 
Psychol. Rep., 1956, 2, 293-312. 
McReynolds, P. Anxiety as rel 
Psychol. Rec., 1958, 8, 57-66. 
McReynolds, P., and Ullmann, L. P. Differential recall o 
words as a function of anxiety. J. 
Mathews, Anne, and W: 


ation of emotional char 


ages during pregnancy 
ations in unmarried primigravid 


ас. Amer. J. Obst. Gynec, in 


A. C. Relations between maternal 


ated to incongruencies between values апа feelings. 


f pleasant and unpleasant 

clin, Psychol., 1964, 20, 79-80. 

ertheimer, M. A "pure" meas 

taminated by response suppression. J. abnorm, 

Megargee, E. I. Undercontrol and overcontrol 
Unpublished doctoral dissertation, Univer. 


ure of perceptual defense uncon- 
soc. Psychol., 1958, 57, 373-376. 

in assaultive and homicidal adolescents. 
of California, Berkeley, California, 1964. 


REPRESSION-SENSITIZATION 219 


Megargee, E. I., and Mendelsohn, С. A. The assessment of the chronically overcontrolled 


. assaultive person. Unpublished manuscript, 1964. 

Nelson, S. E. Psychosexual conflicts апа defenses in visual perception. J. abnorm. soc. 

| Psychol., 1955, 51, 497-133. 

Nowlis, V., апа Nowlis, Helen H. The description and analysis of mood. Ann. N.Y. 
Acad. Sci., 1956, 65, 345-355. 

O'Connell, W., and Peterson, Penny. Humor and repression. J. exist 
1964, 4, 309-316. 

Page, H. A., and Markowitz, 
bias. J. Psychol., 1955, 40, 431- 

Perloe, S. I. Inhibition as a determin 
1960, 11, 59-66. 

Pomeranz, D. M. The repression-sensitization dimension and reactions to stress. Unpub- 
lished doctoral dissertation, Univer. of Rochester, New York, 1963. 

Postman, L., Bruner, J. 5., and McGinnies, E. Personal values as selective factors in 
perception. J. abnorm. soc. Psychol., 1948, 43, 142-154. 

Rogers, C. R. A theory of therapy, personality, and interpersonal relationships as 
developed in the client-centered framework, In S. Koch (Ed.). Psychology: А study 
of a science, Vol. 11: Formulations of the person and the social context. New York: 
McGraw-Hill, 1959, Pp. 184-256. 

Rokeach, M. The open and closed mind. New Y: 

Schaefer, E, S., and Bell, R. Q. Development of a par 
ment, Child Develpm., 1958, 29, 339-361. 

Sermat, V, Personal communication, 1962. 

Shannon, D. T. The effects of ego-defensiv 
nition, Unpublished doctoral dissertation, 
1955. 

Shannon, D. T. Clinical patterns of defense as revealed in visui 
J. abnorm. soc. Psychol., 1962, 64, 370-377. 

Shrader, W. К. Size-distance perception of hu 

and defensive mode. Unpublis 


Massachusetts, 1963. 
mmunication, 1963. 


ential Psychiat., 


Gloria. The relationship of defensiveness to rating scale 


ant of perceptual defense. Percept. mot. Skills, 


ork: Basic Books, 1960. 
ental attitude research instru- 


e reactions on reported perceptual recog- 
Stanford Univer., Stanford, California, 


al recognition thresholds. 


man images as a function of self-ideal 

discrepancy hed doctoral dissertation, Univer. of 
Е Massachusetts, Amherst, 
Silber, D., and Baxter, J. C. Personal cor 
Solomon, R. L., and Postman, L. Frequency of usage as a determinant of recognition 

: 5 5-902 

thresholds for words. J. exp- Psychol, 1952, 43, 195-202. І 
Speisman, J. €. Lazarus, R. S., Mordkoff, A. M., and Davison, L. A. The experimental 

reduction of stress based on cgo defense theory. J. abnorm. soc. Psychol., in press. 
Tart, c. T Frequency of dream recall and some personality measures. J. consult. 


.. Psychol, 1962, 26, 467-470. 
l'empone, V. J. Differential thresholds of repressers and sensitizers as a function of a 
and failure experience. Unpublished doctoral dissertation, Univer. of Texas, 
. Austin, Texas, 1962. 
Tempone, V. J. Construc 
Paper read at Amer. Psychol. 
Truax, C. B. The repression response 
Psychasthenia index. J. abnorm. $ос. Psychol., 
Turk, н, Norms, persons, and sentiments. Sociometry, 
Ullmann, L. P. Clinical correlates of facilitation anc 
tional stimuli. J. proj. Tech, 1958, 22, 341-347. 
Ullmann, L. P. An empirically derived MMPI scale w 
inhibition of recognition of threatening stimuli. J. clin. Psychol., 


t validity studies of Byrne's Repression-Sensitization Scale. 


Ass., Philadelphia, August, 1963. 

to implied failure as a function of the hysteria- 
1957, 55, 188-193. 

1963, 26, 163-177. 

1 inhibition of response to emo- 


hich measures facilitation- 
1962, 18, 127-132. 


220 DONN BYRNE 


Ullmann, L. P., and Lim, D. T. Case history material as 
of patterns of response to emotional stimuli in a 
Psychol., 1962, 26, 221-295. 

Ullmann, L., P., and McReynolds, P. Di 
patients: Empirical findings and theoretical for; 
Psychol. Ass., Philadelphia, August, 1963. 

Ullmann, L. P., Weiss, В. L., and Krasner, L, The effect of verbal conditioning of 


emotional words on recognition of threatening stimuli. J. clin. Psychol., 1963, 19, 
182-183. 


Weinberg, N. H. Word association style, field 
variables. Unpublished manuscript, 1963. 
Wiggins, J. S. Convergences among stylist 

in press. 
Winder, C. L., and Rau, Lucy R. Parental attitudes associated w 
in preadolescent boys. J. abnorm, 
Worchel, Р. Adaptability screenin 
inventory for predicting mal 
56-62, 1957. 


à source of the identification 
study of humor. J. consult. 


dependence, and related personality 
ic response measures. Educ, Psychol. Measmt., 


ith social deviance 
с. Psychol., 1962, 64, 418-424. 

Б of flying personnel: Development of a self-concept 
adjustment. SAM, USAF, Randolph AFB, Texas, No. 


INTERPERSONAL PERCEPTION 


Victor B. Cline 


UNIVERSITY OF UTAH, SALT LAKE CITY, UTAH 


1. “Introduction: e ene 
П. Standardized Judging Tests .. 
II. Cronbach's Components and the End of Naiv 


e Empiricism 225 


A. Cronbach's Components .......s seer es 297 
B. The RS-AS-ACC Problem ............ ee eces 230 
234 


C. Other Sources of Error in Judging-Accuracy Scores .. 
D. Hatch’s Forced-Choice Differential Accuracy Approach 2 


E. Accuracy versus Process Approaches in Research .... 238 
IV. Analysis of Judgments from Motion-Picture Samples .... 940 
A. Construction of Films .......... е 241 
B. Preliminary Reliability and Validity 243 
C. The Generality of Judging Ability ... .. 245 
D. Relationship between the Scoring System and Com- 
ponents 252 
E. Effect of Varied Stimulus Information on Judging- 
Accuracy Scores vu BBS) 
F. Components of Judging-Accuracy Scores Applied to 
the Clinical versus Statistical Prediction Controversy 260 
G. Comparisons of Groups versus Individuals in Making 
Judgments about Others ...... 2A 266 
H. A Factor Analytic Study of the Interpersonal Accuracy 
CORRER „гәне чы Tigi CRN даша бна качи HAE 272 
I. Effect of Social Desirability Response Set on Judging 
Accuracy m 
V. Summary ...- 
References 


L Introduction 
Appraising, understanding, and judging others underlies nearly all 
human intercourse. Most kinds of mental and emotional illness involve 


fairly serious distortions or breakdowns in communication as well as in 
Interpreting and evaluating social and interactional cues. The evaluation 
and assessment of others by all men is such a consistent and on-going 


Process that it operates almost automatically at times, commonly at non- 
verbal and sometimes at almost unconscious levels. It can be a highly 
rational ог even formal process. As Asch has put it, "To take our place 
With others we must perceive each other's existence and reach a measure 
of comprehension of one another's needs, emotions, and thoughts." 
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In a great variety of formal occupations, positions of leadership, and 
marital, parental, and psychotherapeutic relationships, as well as in in- 
formal social units, effective functioning would appear to be critically 
related to and dependent upon our ability to perceive subtleties and 
nuances in the behavior of others. We must anticipate their reactions to 
different decisions and discern those inner states and convictions which, 
as Gough (1955) has put Th, 5... although determining choice and be- 
havior are often concealed and/or denied in overt expression." 

The judging process has been referred to by many names including 
empathy, insight, clinical intuition, identification, diagnostic competence, 
understanding, social sensitivity, social acuity, soci 
personal perception (or perceptiveness), person perception, and person 
cognition. АП have slightly varying meanings. For example, “person per- 
ception" implies that the judge or perceiver exists apart and outside the 
world of the person being judged. Such is the case when one uses still 
photographs of an "other" as the primary source of cues from which 
judgments are made. If, however, the judging situation involves a double 
interaction between judge and "other," as usually occurs in real life, it 
might be more accurately referred to as "interpersonal perception." 
Tagiuri and Petrullo (1958) have commented, “Through his own presence 
and behavior in the perceptual situation of the ‘other,’ the perceiver 
(judge) may alter the perceptual characteristics of the person whose state 
he is trying to judge. This is, of course, eminently different from the way 
in which a rock is a source of cues for a perceiver.” Despite differences 
such as these in the conditions and nature of the judging task, this general 
area of investigation continues to receive increasing attention from many 
investigators despite a plethora of methodological complexities. 


al perception, inter- 


П. Standardized Judging Tests 


When one surveys the literature and зис} 


1 references as “Buros 
Mental Measurements Ye 


ar Book" for signs of carefully validated pub- 
lished social sensitivity, empathy, or judging tests, the cupboard is bare. 
The only tests of note reported in Buros (1959) are Kerr and Speroff's 
Empathy Test (1951) and Kerr's Primary Empathic Abilities procedure 
(1957). These, however, tend to be somewhat misnamed in that they 
attempt to measure sensitivity to the “generalized other’! 


р by requiring 
such things as ranking the circulation of 15 magazines, the popularity 
1 "Sensitivity to the generalized other" is a term introduced Ъ 
et al. (1958) and is defined as an awareness E the Social no; 
response of a large class or group. Example: predicting the тези] 
poll, or a typical response of a special class of people—say, 
also sometimes called "stereotype accuracy. 


y Bronfenbrenner 
Tm Or of the typical 
ts of a public opinion 
college students, This is 
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of 15 types of music, and so forth. The evidence for validity for these 
tests is so slight as to eliminate them from serious consideration as genuine 
measures of empathy. 

Gough (1953) developed an Opinion Prediction Scale which seems 
to measure accuracy of knowledge or stereotype about people in general, 
ог, in other words, sensitivity to the "generalized other." Judges are 
required to predict the percentage of "true" responses to a number of 
MMPI-like items for “people in general” and for male prison inmates 
using a multiple choice response format. The options are 0-25%, 26-50%, 
51-75%, and 76-100%. While this is not a measure of judging ability for 
Specific individual persons (often referred to in the literature as "social 
objects” or "others"), Gough (1955) has found it a useful instrument in 
some studies of “social acuity.” 

In addition there are some procedures such as Chapin's Social Insight 
Test (1942) which have not been published but which have been used 
In occasional studies with the author's consent on an experimental basis. 
Knowledge of their existence has been circulated informally as well as 
through occasional references in the psychological literature. Frequently, 
as in the case of Chapin's test, the results have been equivocal. With 
Chapin's test several paragraphs of a true-to-life situation are presented 
and the judge is asked to choose the correct outcome from several alterna- 
tives. In some cases this involves predicting what an individual feels or 
Will do. Other situations require predicting what a group will say, or do. 
In one study (Gough, 1955) correlations of .47, .47, and .39 were found 
between Chapin's test and the Terman Concept Mastery Test of Intelli- 
Bence, the Crutchfield Battery of Insight Puzzles, and the Bennett 
Mechanical Comprehension Test. 

The sound films and judging instruments of Cline (19552) and Cline 
and Richards (1958, 1963) are further examples of judging procedures 
Which are still "experimental" and available for research purposes only. 
These include three entirely separate sets of films developed over a 
decade, with up to 12 separate judging tests per film and up to 30 films 
10 а set. 

From Flanagan's Aptitude Classification Test the "Judgment and 
Comprehension” subtest has some items that might be regarded as tapping 
Judging ability or social sensitivity though Flanagan makes no claims 
that this subtest measures other than the “ability to understand a situ- 
ation and determine the proper action to take.” The “How Supervise,” 
(Remmers and File, 1948) sometimes used as an executive or foreman 
Selection test, could conceivably be regarded as an interpersonal percep- 
tion test. However, its validity data limits it to being a “crude device for 
Screening first level supervisory personnel of low educational level.” 


224 VICTOR B. CLINE 


The Moss et al. (1944) “Social Intelligence Test” might be classed as a 
measure of empathy. However, existing research suggests that it is 
somewhat disguised measure of verbal intelligence and not much more. 


TABLE I 


VARIETIES OF STIMULUS INFORMATION USED IN МАК 


Photographs 
?. Motion pictures (sound or silent) 


3. Unobserved or hidden observation of live 
mirror) 


1. Face-to-face interactions (from 
such as in work, social 
5. Tape recordings ( 


bchavior (such as through one-way 


brief. interviews through long-term | associations 

‚ neighborhood Broups, or marriage) 

using only auditory cues) 

6. Test materials and scores (IQ, MMPI, 

7. Written material (biography, 
paragraphs, etc.) 

8. Personal productions (handwriting, 


9. Past experience with people in a culture (where the judge is predicting local. 
regional, or national Sroup norms, such as estimating Percentage of population 
responding “уез” to a Particular Personality test item) 

10. Any combination of the above 


Rorschach Protocols, etc.) 


autobiography, interview typescripts, descriptive 


» drawings, etc.) 


TABLE II 
TYPES or Јорсікс INSTRUMENTS 
Trait-rating Procedures (; 
24, Postdicting real life 


adjective checklists, Likert-type rating 

behavior (usually using a true-false 

3. Postdicting responses to specific test items (such as an 
Interest Blank, etc.) 

d: Postdicting 
etc.) 


ales, etc.) 
or multiple choice format) 
MMPI, Strong Vocational 


scores on achievement, intelligence, and personality tests (MMPI, IQ, 


5. Postdicting theoretical constr 


ucts (such as 
6. Writing free 
7 


global descriptions of Social o 
Using matching techniques (where 2 to x stin 
sets of data). Thus one might read person 
to match up 10 Rorschach Protocols to th 

8. Ranking procedures (2 to x sti 
possession of trait or variable) 

9. Forced-choice procedures. These are 


a psychiatric diagnosis) 
bject being judged 
mulus persons 
ality description: 
ese) 
mulus persons are 


are matched on 2 to x 
s of 10 people and try 


rank ordered on degree of 
infinite in 


involve choosing which of two statements 
with 


variety, but for 
a pair of “others” 


mple, might 

agrees or disagrees 

10. Q sort (Here a group of persons can be 
trait or one person may have 50 v. 
like" him) 


П. Any combination of above 


Q sorted wit 
ariables Q sorted 


h regard to one personality 
from “most like” to “least 


Most researche: tended in the past to d 


ing tests in the absence of any valid procedures 
publishers. These are usually simple in nature 


€velop their own judg- 
available through test 
and tailor-made to the 
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needs of the study. They involve such things as predicting ratings on 
traits, responses to personality test items, or the like. 

Why the conflicting findings in the literature? Analysis of the 
hundreds of interpersonal perception studies published in the past two 
decades reveals that the kinds of stimulus information given to judges 
vary very greatly in type, quality, and quantity. Table I lists a few exam- 
ples of these. In addition, the choice of judging instrument or procedure 
can vary from simple rating instruments to elaborate Q sorts, as demon- 
strated in Table 11. When we observe the varieties of sources of criterion 
information (Table Ш), and the different possible scoring procedures 


TABLE III 
Sources OF CRITERION INFORMATION 


1. (a) Sclf-provided information. This is where one uses the self-ratings, self-Q sorts, 
test scores, or other information coming from the person one is judging, as the 
criterion, 


(b) Group responses to ratings, variables, personality test items, etc. are predicted 
by judge. This becomes a measure of stereotype accuracy. 

& Associates (peers, subordinates, spouse, teacher, boss, acquaintances, etc.) provide 
criteria in form of ratings, evaluations, Q sorts, or other types of information 

. about the person. being judged 

8. Experts (where psychologists, social scientists, therapists, personnel managers, or 

other "experts" provide ratings, diagnoses, Q sorts, etc.) 

Combinations of above 


available (Table IV), it is easy to understand why searches of the "judg- 
ing literature" reveal so many apparently contradictory findings. We are 
dealing with an area of such extreme complexity that experimental 
methods to determine judging accuracy are seldom identical, and gener- 
ally varied. There does not exist as yet a single, adequate, demonstrated 
lest of judging accuracy for either global- or component-type measures, 
though some valiant efforts have been made to produce such an instru- 


ment or instruments. 

Writers who have argued that there is no general ability to judge 
Others for empathy, intuition, or a general trait, etc. have invariably cited 
the fact that the findings of judging studies show considerable inconsis- 
tency, While this is a legitimate argument, it could be applied to many 
Other areas of psychological research. The reasons for conflicting findings 
are complex, and cannot be referred to single simple factors, | 


ПІ, Cronbach’s Components and the End of Naive Empiricism 


| In 1955, Ronald Taft reviewed the psychological literature. in the 
area of interpersonal perception. He appraised and evaluated several 
Score of the most significant studies to that date. This was, at the time 
4 definitive and major assessment of the literature. However, later in ie 
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TABLE IV 
SCORING PROCEDURES 


l. Number of correct predictions (where multiple-choice, accuracy of diagnosis, rank 
order, true-false, forced-choice, matching, or paired-comparison-type formats are 
used) 

2. Difference score or “D2 statistic," where rating scales are used; in its 


the predicted rating is subtracted from the crit 
give 


simplest form 
erion rating on a “l to 5 scale" to 
a D or difference scorc. These can be summed over all traits or they can be 
first squared and then summed. This can also be broken dow 


n mathematically into 
a number of components (see Cronbach, 1955; Cline and Richards, 1960, 1961b, 
1962) 


3. Correlation statistic (where one correlates a number 


judge against criteria or correlates one у 
criteria, Partial correlation technique 

4. Quantified evaluations of “free glob 
done by experts 


of predictions made by onc 
ariable for a number of judges against 
would be one form of this 


al written description or responses," usually 


Special scoring procedures. These can be infinite 
the literature will be cited 


(a) Crow (1954) has developed a 
Differential Accuracy w 
it is distribution-fre 
about several "others," 


К { es from 
in variety but two examples from 


"Random-Comparison" method for determining 
hich differs from Cronbach's method also discussed in that 
or nonparametric. Whenever 


the judges make estimations 
a distribution of comparisons is generated by comparing 
every estimation with every "other's" Score, ignoring signs. Thus if cight judges 
each made an estimation for each of ten “others,” the comparison procedure would 
yield 800 difference scores. The difference score obtained by comparison of the 
judge's estimation with the score of the “other” for 
intended can be compared у. 
If this difference score 


whom the estimation was 
ith the median of the random comparison distribution. 


is more accurate (lower) than the median of the random 
comparisons, then the judge has obtained Differential Accuracy with that “other. 


The final score is the Percentage of "others" upon which the judge was more 
accurate than the median of the random comparison distribution 

(b) Bronfenbrenner et al. (1958) developed the “Method of Differential Compari- 
son” as a measure of judging accuracy. They believed this to be independent of 
the judge's similarity to the person judged and sensitivity to the 

(Stereotype Accuracy). Any tendency to "assume similarity" 
score. It was obtained by computing the c 


by expressing each particular judging estimate as a deviation (xj) from the mean ot 
the estimates made by the judge for all "others" on that item, Each 
was expressed as a deviation €) from the mean rating 
that item. Computation of the simple correlation (G 


generalized “other 
could not affect this 
orrelation on a within-item basis; that 15, 


criterion rating 


items and "others" for the particular judge < of the judge's ability 
to recognize individual differences among subjects in their response to each item. 
It is also an index of the judge's ability to estimate correctly the relative position 
of self-ratings made by the various “others” on a Particular item 

6. Nonaccuracy measures. These include such things as Elevation, Differential Ele- 
vation, variance of ratings, etc. which do not measure accuracy of judgment per Зе 
but may be quite important in interpreting and understanding the accuracy scores 
obtained 
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year several papers were published (Cronbach, 1955; Gage and Cronbach, 
1955; Campbell, 1955; Hastorf, Bender, and Weintraub, 1955) that seri- 
ously challenged nearly all previous research in the area of judging accu- 
racy. Using logic, mathematics, and analyses of studies, it was demon- 
strated that most judging experiments in the literature had serious flaws 
which rendered them largely uninterpretable. 'There had been earlier 
Critical studies (Cohen, 1953; Cronbach and Gleser, 1953; Hastorf and 
Bender, 1952; Leytham, 1951), and others have appeared since (Crow, 
1957; Crow and Hammond, 1957; Gage, Leavitt and Stone, 1956). How- 
€ver, Cronbach's (1955) paper provided a major methodological critique. 
Using the simple D? statistic (i.e, the arithmetical difference between 
judgment and criterion ratings, squared), he found that this apparently 
Straightforward measure of judging accuracy contained a Pandora's box 
of "components, artifacts, and methodological problems" which would 
vitiate many previous research results. Cronbach demonstrated that “‘dif- 
ference scores” on trait ratings could be analyzed mathematically into 
four component parts some of which could be reduced further into vari- 


ance and correlation parts. 


A. CRONBACH's COMPONENTS 

1. Elevation (E) 

"This component measures the difference between the (a) grand mean 
of an individual judge's predictions for all "others" judged on all items 
and (b) the grand criterion mean. This tells us primarily something about 
the way in which a judge uses the rating scale and does not deal with 
Accuracy per se. 


2. Differential Elevation (DE) 

This measures the variance of the individual judge's ratings. It 
Measures the extent to which the judge can predict the deviation from 
the grand mean of the mean of individual “others” taken over all traits. 
This score can be broken down into a correlation term (which represents 
the judge's ability to judge which "other" rates highest on the elevation 
scale) and a variance term which measures the extent to which judged 
differences between “others” are large or small. 


Ji Stereotype Accuracy (SA) 
This is a measure of what Bronfenbrenner has called “Sensitivity 
to the Generalized Other,” or the degree to which a judge can predict 


how the whole N of “others” responded to a set of items. The predicted 
i à 
tem means averaged across persons are compared with the actual cri- 
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1 s relati rm which 
terion item means across persons. This provides a correlation term wl | 
i i i Н { i anc 
gives the relationship between a Siven set of predicted item means : 
iter ‘ "i and а variance term 
the actual criterion means (of all “others judged) and a variance te 


+ . wie, B 2 s" 
which is an index of the variability and/or complexity of the judge: 
stereotype. 


4. Differential Accuracy (DA) 


This measures the ability 
between "others" on each trait 
another way, this measures the c 
on individual items in Ше jud 
where in each case the "other's" 


of the judges to predict the differences 
or item considered separately. To put is 
lifference between the scores for “others 
ging matrix versus the criterion matrix 
score is the deviation from both his own 
mean and the item mean. Thus this can be derived mathematically by 
subtracting E, DE, and SA from the trait rating total score. It also has а 
correlation and variance term, This is considered a "pure" accuracy 
measure by some, even though under Some circumstances it also has flaws- 

Cronbach (1955) concluded that, 
been dominated by simple operation 
accuracy). Analysis has shown that 
thereby conceal important y 
components. Only by a с; 
investigator hope to know 
mented, "Investigators have been attracted to this 
in such constructs as transference, identification, 
and insight. We must question whether these terms 
in future interpretations of social perception experir 
which looks like ‘projection’ can arise 
there is little point in trying to formulate hypotheses using the concept 
‘projection.’ Instead, theories of perceptual response should take into 
account the traits being perceived, the constant tendencies in this për 
ceiver (judge) with respect to those traits, and finally the effect of the 
particular ‘other’ as a social stimulus to this Perceiver. Worthwhile 
hypotheses of this order cannot be formulated until data from explora- 
tory studies have been treated analytically (Le, mathematically broken 
down into component scores) z won @ single global index (ог judging accu- 
racy) can now be replaced with à series of components of the perceptual 
relation and a separate analysis. can be made for each. Even the simplest 
examples make it evident that insight, assumed similarity, and the other 
global phenomena are the shadow not the substance of social perception." 

In recent research by Cline and Richards (1960, 1962), and Richards 
and Cline (1963) it was demonstrated that the system used in 


“Social perception research has 
ally defined measures (of judging 
any such measure may combine and 


ariables or may depend heavily on unwanted 
areful subdivision of globa 


1 measures can an 
what he is dealing with.” 1, 


ater (1958) he com- 
area by their interest 
projection, empathy, 
should be employed 
ments. If a behavior 
out of many different processes, 


assigning 
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numerical scores to the characteristics (traits) included in the judging 
matrices could produce artifacts tending to reduce or, in extreme cases, 
to eliminate Stereotype Accuracy and Differential Elevation correlation 
terms. Cronbach's Differential Accuracy in certain circumstances is thus 
made diflicult or impossible to interpret. These artifacts were seen as 
particularly objectionable in the case of Differential Elevation and Dif- 
ferential Accuracy. To overcome these difficulties, Cline and Richards 
(1962) and Richards and Cline (1963) proposed a new measure, Inter- 
personal Accuracy (1А), which appears to be the linear combination of 
Differential Elevation and Differential Accuracy but is free of the afore- 
mentioned artifactual effects of the scoring system. This new component 
is discussed in detail later in this paper. 

" Other investigators have become increasingly chary of naive empir- 
lcism in interpersonal perception research with the omnipresent danger 
of misinterpretation of results. A good example of this is Leytham's 
(1951) review of Taft's (1950) study of the personality correlates of judg- 
Ing ability. In Tafts research, using 40 graduate students, he studied 
judging ability by requiring each subject to rate a number of his fellow 
assessees, with whom he was familiar. Six personality variables were rated 
Оп a conventional five-point scale. Each judge also predicted how his 
peers would rate him. The criterion was the consensus of experts (research 
staff) and peer ratings. The more accurately the judge rated others and 
Predicted self ratings, the better judge of personality he was assumed to 
be. The next step was to find test and rating correlates of judging 
ability as thus defined. Leytham later pointed out a statistical artifact, 
Le, that cautious judges (who made many "three" or average ratings) 
Were given a major advantage over those who tended to use the more 
eXtreme ratings. If judging accuracy were computed by the sum of the 
differences from the criteria consensus ratings then the rater who gave 
Nothing but “three” or average rating would be easily the “most accurate" 
judge, because he could never obtain a discrepancy or error score greater 
Шап two (i.e, 5 — 8 = 2 or 8— 1 = 2). Since the criteria consensus rat- 
ings (а summary mean score) tended to regress toward the mean of three, 
this amplified the spurious advantage already given to the cautious 
Judges. Thus when Taft correlated his judging accuracy scores with other 
Outside measures, he found a relationship between them and a "response 
Set" to judge cautiously, but not necessarily "accurately." 

Cronbach (1955) has commented on this point: "If two diagnosticians 
can each judge some trait with the correlational validity of 40, the one 
Who differentiates strongly (i.e., makes extreme statements) will make far 
More serious absolute errors than the one who differentiates moderately. 
Indeed, the person who makes extreme differentiations based on a validity 
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of .40 may have larger errors than a judge who has zero correlational 


validity but gives the same estimate for everyone.” 

In another example, Dymond, Hughes, and Raabe (1952) had sixth 
grade pupils rank each other sociometrically 
of ratings they (the raters) receiv 
accuracy of judgment or estimati 


and also predict what sort 
ed from the others. They found that 
on correlated .50 with sociometric ac- 
ceptance. However, the students generally tended to predict that they 
would be highly accepted. This meant that those students who, in fact, 
were highly accepted automatically obtained higher accuracy scores. This 
fact alone is sufficient to account for the apparent relationship between 
judging accuracy and sociometric acceptance. 

In a study by Nagle (1954) 14 supervisors were each asked to predict 
the number of their own subordinates responding in a favorable direction 
to some morale items having to do with their attitudes toward the 
supervisor. Accuracy of prediction was obtained for each supervisor by 
summing, over the items, the differences between the supervisors’ pre- 
dictions and the in-fact responses of the subordinates. For each one of 
the supervisors’ groups a summary morale score (based on subordinate 
attitudes toward the supervisor) was obtained from this same instrument. 
A correlation of .90 was found between supervisors’ empathic accuracy 
and the morale index. Nagle thus concluded that empathy, or judging 
accuracy, was related to “morale,” as defined in this study, and his data 
apparently supported this conclusion. However, a more parsimonious 
explanation would be that most supervisors predicted highly favorable 
attitudes toward themselves by the men in their group. Thus the super- 
visor whose group morale (as indicated by the favorable attitudes shown 
toward him by his subordinates) was highest would have to receive the 
highest empathy or judging score. Spuriously high scores on empathy 
measures will be made by the truly popular subject when his empathy is 


tested by ability to assess his own popularity; а common human response 
set toward rating oneself as popular will Buarantee this even in the most 
imperceptive of popular people. 


B. Tue RS-AS-ACC PROBLEM 


Various investigators hay 


e been concerned with 
problem. In this case, 


the RS-AS-ACC 
a = the judge's self-description on a trait y 
b = the "other's" (person being judged) $ 
c = the judge's prediction of b. 


ariable, 
elf-description, 


Real Similarity (RS) is where a — p, Assumed Similarity (AS) is 
where a = c, and Accuracy (ACC) is where b = ¢. Many social perception 


INTERPERSONAL PERCEPTION 281 


studies have gone awry because of the lack of understanding that while 
any two of these relationships are independent, the third remaining rela- 
tionship is directly and mathematically linked to and a resultant of the 
other two. Hatch (1962) has commented that, “Where each of a number of 
judges possesses high RS with their others (persons being judged) their 
accuracies will correlate perfectly with the degree to which the judges 
assume similarity with their others. A typical situation is one in which 
each of the judges predicts with a high degree of assumed similarity. The 
accuracy score then varies directly with the amount of real similarity 
existing between the judges and their respective others. This latter rela- 
tionship, real similarity, may best be viewed as a fixed constant which 
exists between a judge and a specified other, as a function of the sampling. 
With the high degree of assumed similarity often found in empathy 
studies (especially where judges predict for some variable allowing self- 
flattering predictions), the resulting accuracy scores bear a highly signif- 
icant relationship to this original constant, RS. Such accuracy scores 
certainly do not represent novel emergents reflecting on empathic sensi- 
tivity of some kind. They are simply the logical consequences of RS-AS- 
ACC linkages and are impossible to interpret.” 

An example of an “RS-AS-ACC study” is Smith’s (1957) study of 
rapport and social perception in mother-adolescent relationships. Some 
15 different judging scores were derived from the four types of data listed 
below. 


(a) Daughters took Kuder Preference Record for themselves; 

(b) Mothers predicted their daughter's responses on this Kuder test; 
(c) Mothers predicted typical adolescent responses to the Kuder test; 
(d) Mothers took Kuder for themselves. 


The following 15 scores were derived from this information. 

(1) Accuracy (ACC): Number of items on daughter's Kuder which 
coincide with mother's predictions for her. 

(2) Real Similarity (RS): Number of items on which mother's self- 
Kuder and daughter's self-Kuder coincide. 

(3) Real Dissimilarity (RD): Number of items on which mother's 
and daughter's self-Kuders differ. 

(4) Assumed Similarity (AS): Number of items on which mother's 
self-Kuder and mother's prediction of her daughter's Kuder coincide. 

(5) Assumed Dissimilarity (AD): Number of items on which mother's 
Self-Kuder and mother's prediction for her daughter differ. 

(6) Warranted Assumed Similarity (WAS): Number of items on 
Which mother's self-Kuder, daughter's self-Kuder, and mother's prediction 
for her daughter all coincide. 
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(7) Warranted Assumed Dissimilarity (WAD): Number of items or 
which mother's self-Kuder does not agree with daughter's self-Kuder, but 
does agree with mother's prediction for her daughter. 


(8) Unwarranted Assumed Similarity (UAS): Number of items ar 
which mother's self-Kuder agrees with her predictions for her daughter 
but not with the daughter's actual self-Kuder responses. Н 

(9) Unwarranted Assumed Dissimilarity (UAD): Number of items 
on which the mother's sel-Kuder coincided with her daughter's self- 


Kuder but not w of her daughter's Kuder 
responses. 


(10) Stereotype Accuracy (SA): Number of items on which mother's 


prediction of a Lypical adolescent Tésponse on Kuder coincide with her 
daughter's self-description, or 


responses, on the Kuder. И 
(11) Stereotype Inaccuracy (SI): Number of items on which mother's 
prediction of the typical adolescent response differs from her daughter's 
self-description or responses on the Kuder. 
(12) Assumed Conventionality (АС): 
mother's prediction of a typical 
of her daughter's Kuder response: 


(13) Assumed Unconventionality (AU): Number of items on which 


mother's predictions of a typical adolescent response and a prediction of 
her daughter's Kuder response differs. 


(14) Warranted Assume. 


ith the mother's predictions 


Number of items on which 
adolescent response and her prediction 
5 coincide. 


d Conventionality (WAC): Number of items 
where mother’s prediction of typical adolescent responses, her daughter's 
self-description, and the mother's prediction of the daughters Kuder 
responses all coincide. 

(15) Warranted Assumed Unconventionality (WAU): Number of 
items where mother's prediction of typical 
with the mother’s prediction of her d 
agree with her daughter's actual sell- 

Smith found low positive correlations between 
offspring rapport and judging accuracy scores. 
suggested that this accuracy was due ir 
rapport judges (mothers) to assume similarity and 
conventionality. But because of the methodology о 
could be presented showing how much of the 
attributed to these artifactual processes. 

Cronbach's (1955) previous analysis of the components of 
scores emphasized the “difference score” already descril 
is the correlation between predicted scores a 


adolescent responses agrees 
aughter's Tesponses, but does not 
description on the Kuder, 

mother-adolescent 
However, the evidence 
the tendency of high- 
also Possibly to assume 
f the Study no evidence 


accuracy variance could be 


judging 
alternative 
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method defines predictive accuracy as the ability to vary one’s predictions 
as the actual situation varies. The ‘difference score’ method defines pre- 
dictive accuracy as the ability to approximate the actual situation. By the 
‘difference score’ method a subject (judge) is penalized for a systematic 
error in estimation of the magnitude of the actual situation. By the cor- 
relation method the subject (judge) is not so penalized. Conversely, a 
subject (judge) is penalized by the correlation method, if although he has 
approximated the actual situation, his predictions do not vary concomi- 
tantly with the actual scores.” 

Thus, in the correlation method such response sets as Cronbach's 
Elevation or those associated with the variability of prediction and re- 
sponse are removed, though the RS-AS-ACC linkages remain. It should 
be noted that a correlation approach can require a judge to predict for 
many "others" on a single variable, IQ for example, or it may require 
him to make predictions [or a single "other" on a large number of 
Separate trait variables. In either case methodological complexities re- 
main which can render judging accuracy scores suspect. For es ample, 
special problems in interpretation arise in the case where a correlation 
is calculated between criterion and a judge's predictions of the self-score 
of each of a number of “others” on a single variable. Here, as Hatch (1962) 
has pointed out, there is a tendency for the judge to assume similarity 
(defined as perceiving others as similar to oneself). This must be con- 
sidered as a response set and this will differ from “other” to “other” ap- 
parently as a function of the judge's generalized attitude toward the 
“other.” 

The tendency to assume similarity is fairly constant over preferred 
“others.” However, knowledge of the degree of assumed similarity be- 
tween a judge and a liked “other” provides no basis for predicting the 
degree of assumed similarity which will be exhibited by the judge for a 
disliked “other.” 

Gage and Cronbach (1955) have noted that assumed similarity is 
highly general over items, and people tend to assume similarity to the 
same degree throughout a questionnaire despite marked variety in the 
item content with this tendency being somewhat general for preferred 
"others." Thus, a judge's attitude and set toward an “other” will in- 
fluence the AS score. 

Hastorf and Bender have tried to clarify and solve the problems 
associated with ACC, or accuracy, which they call “raw empathy,” by 
subtracting AS from it to give a “refined empathy” score. The major 
problem here is that they lose important data when they do this because 
on of ше subtracted А5 Чага represent true accuracy of judgment. 

age and Cronbach (1955) in an excellent discussion of this note that 
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spurious relationships can occur wherever the "other's" self-score cor- 
relates with the “outside” variable, which is to be paired in a correlation 
with the judge's accuracy scores. They suggest that researchers test for 
the presence of spurious or artifactual relationships between the accuracy 
and empathy scores on the criterion variables by checking the relation- 
ship between the “response to be judged” and these same criterion 
measures. An example is given to further illustrate this point. Employees 
are asked to rate their department supervisor and he in turn predicts 
their ratings on а 140-5 point scale. If every supervisor predicts that his 
group will rate him 5 (very good), as might be expected, then there will 
be an automatic perfect correlati 


on, 1.00, between the supervisors' "em- 
pathic ability" 


and receiving high ratings as a supervisor. 
C. OTHER SOURCES OF ERROR IN JUpGiNG-AcCURACY Sconrs 

To give the reader an even more det 
kinds of biases and response sets, etc., 
well as "chance success" 
below. 


ailed and clearer idea of the 
which can add error variance as 
to judging-accuracy scores, a few will be listed 


(1) Social desirability bias: This refers to the tendency for judges 


usually to predict the most socially desirable response in making judg- 
ments and predictions about “others,” 

(2) Similarity of the judge to the “other”: 
personality, religion, occupation, class lev 
background. This in turn m 
like himself, and respond to 
to (9) below. 


(3) Acquiescence set: This refers to a tendency to agree with or pre- 
dict “Yes” to items rather than to choose the negative or “No” responses. 

(4) Use of stereotype: Wherein the judge uses an undifferentiated 
stereotype of “fraternity men,” “Jewish women,” etc. to predict for specific 
“others,” and only if his "other" fits the stereotype will he achieve ac- 
curacy. 

(5) Personal reactions: Liking or disliking the individual being 
judged can produce a “halo” effect in rating or judging. 

(6) Making use of an implicit personality theory: \ 
assumes there is an invariant relationship between tr. 
the "other") and traits b,c,and d (not observed but assumed correlated). 

(7) Tendency to make extreme ratings or Judgments: This is a re- 
sponse set wherein the judge overdifferentiates in his ratings of “others,” 

(8) The central tendency response set: Wherein the judge sees most 
“others” as being very similar or alike and gives mostly middle range or 
average scores with very little differentiation from “other” to “other.” 


This may be in sex, age, 
el, ethnic, social, and cultural 
ay lead the judge to believe that “the other” is 


the items as if this were so. This is very similar 


Vherein the judge 
ait a (observed in 
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(9) To assume similarity or project: In this case, the judge sees 
"others" as being very similar to himself (as defined by his self-ratings). 

(10) To assume dissimilarity or project “in reverse”: Wherein one 
perceives most “others” as being different from oneself. 

(11) Semantic ambiguities in trait terms: Here the judge interprets 
the trait name in the judging instrument assuming a meaning other than 
that intended in its criterion development and use. 

Mathematical linkages between predictor and criterion variables (as 
already discussed), and the use of “global scores” which contain numerous 
components sometimes working in opposite directions, can also be a 
further source of error. Several investigators (Cline and Richards, 1960; 
Hatch, 1962; Crow, 1954; Bronfenbrenner, Harding, and Gallwey, 1958; 
Grossman, 1963) have attempted to design studies that would meet these 
objections. However, their researches have met with only partial success. 

Grossman (1963) used 5-min movies of adult subjects being inter- 
viewed. Three films were of males and three of females. These sound-color 
movies had been developed previously by Cline and Richards. By show- 
ing three sets of film at a time and using a matching method, it was pos- 
sible for Grossman to control or eliminate the Elevation and Differential 
Elevation components as discussed in Cronbach's 1955 paper. Grossman 
then attempted to eliminate Stereotype Accuracy, the third of Cronbach's 
four components, by the following technique. 

Starting with 240 judging items for the six films, two item analyses 
were undertaken. The first was an internal consistency analysis to choose 
the most highly discriminating “interpersonal sensitivity” items. The 
second involved giving a four-part test on group sensitivity to the same 
subjects (judges). This test was believed to be a measure of Stereotype 
Accuracy. Items retained in the final judging test were highly discriminat- 
ing for interpersonal sensitivity, but nondiscriminating for group sen- 
sitivity or Stereotype Accuracy. The final correlation between the inter- 
personal sensitivity index and the group sensitivity index was 12; OF 
nonsignificant. The final judging test thus derived was believed to be a 
relatively pure measure of Differential Accuracy. This index was cor- 
related with 38 personality, demographic, and other types of variables. 
Grossman found some evidence for generality of judging ability. His 
Cross-sexes correlation when corrected for attenuation was .48. His cross- 
persons correlation when corrected for attenuation was .60. He also 
found that the more observant a judge was, the higher his judging score. 
He obtained a measure of observant accuracy by testing how much the 
judge could remember about the appearance, actions, and behaviors of 
the people in the film. He also found that the better judges tended to be 
more “open-minded” as measured by three subscales, Religious Skepti- 
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cism, Nonconformity, and Liberalism. He also found low positive correla- 
tions between judging ability and intelligence, using ACE Total, Quan- 
titative, and Language scores. 


D. Натсн'ѕ FORCED-CHOICE DIFFERENTIAL ACCURACY APPROACH 


Hatch (1962) tackled the methodological problems of interpersonal 
perception research with great care, going to considerable lengths to 
control bias, response set, and other artifacts. He identified three primary 
difficulties to be overcome in any research of this kind: (1) solving the 
RS-AS-ACC_ problem; (2) controlling for miscellaneous response sets; 
and (3) developing an adequate “better than chance” test for one's 
judging-accuracy results. He first proceeded to develop a Forced Choice 
Differential Accuracy Test which gave a score corresponding to Cron- 
bach's Differential Accuracy correlation term. The judges (30 branch 
managers of Minnesota Mining and Manufacturing Co.) wi 
to predict which one of two statements, both of 
nates disagreed оп. Example (slightly modified): 


ere required 
a pair of their subordi- 


Mr. Brown Mr. Jones 
= — 


Check One 
1. My sales manager assigns first priority to problems brought to him 
by a salesman. 
My sales manag. 


advice. 


oO 


ю 


er should consult me more often for my opinion or 


A large number of subordinates (318) previously responded either 
Strongly Agree, Agree, Disagree, or Strongly Disagree to items in a sales 
attitude questionnaire from which the pairs of men and their responses 
to items had been taken in developing this empathy test. Hatch found 
that any two subordinates might have identical responses (IR) or non- 
identical responses (NIR) to an item, There were three kinds of NIR 
response subcategories: (A) Maximally Dissimilar Responses (MDR) 
where one subordinate ays “Strongly Agree” versus “Strongly Disagree” 
for the other; (B) Dissimilar Responses (DR) of Agree versus Disagree; 
and (C) Similar Responses (SR) of Agree versus Strongly Agree, or Dis- 
agree versus Strongly Disagree. AII three of these were classified as “dis- 
agreed on" (or NIR) responses of the Supervisor's subordinate pair in the 
empathy test. 

Using this Forced Choice Differential Accuracy 
demonstrated empirically that the structure of the forced-choice items did 
not permit a confounding statistical linkage to exist between RS, AS, and 
ACC. Thus, the judge might obtain accuracy under any combination of RS 
and AS conditions. "Accuracy depends only upon the correct identification 


technique, Hatch 
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of the NIR statement and is uninfluenced by the real similarity and high- 
low assumed similarity conditions; thus, judges with any combination of 
RS-AS relationship may be scored as accurate or inaccurate." Hatch ar- 
ranged to control for four specific types of response sets and set up his 
experimental design so as to control “unidentified response sets.” These 
four response sets were identified as “response variability,” “response 
intensity,” “atypical response pattern of judge,” and “atypical response 
pattern of others." The first ("response variability") will be discussed 
briefly for illustration purposes. From the responses of salesmen to the 
original sales attitude questionnaire, it was found that some items were 
controversial, with much disagreement among respondents. With other 
items there was very little response variance. If most of the NIR state- 
ments had high response variance, it would be possible for a judge know- 
ing only this to obtain a high accuracy score by always picking the most 
controversial item, as the empathy test required predicting which of two 
statements the subordinates disagreed on. To control this, statements 
were matched so that the IR statement variance exceeded the NIR state- 
ment variance in exactly one half of the forced-choice items prepared for 
each judge. Elaborate experimental controls in item selection were also 
established for the other three response sets mentioned previously. 

In an effort to assess the practical validity of his empathy test, Hatch 
correlated it with (a) a measure of human relations skill (by having com- 
pany executives nominate branch managers who were most and least 
successful in interpersonal relations with subordinates), (b) degree of 
acquaintance (between branch manager and subordinate), and (c) degree 
of extrapolation or inference required between information input and 
outtake (prediction). The latter item was assessed by comparing judges’ 
accuracy on each of the three NIR items (SR, DR, and MDR) matched 
with the IR item. Thus, the greatest extrapolation or inference required 
of a judge (and also theoretically the most difficult judgment) would be 
onan IR versus SR item, whereas the easiest would be an IR versus MDR 
item. Each manager made predictions on 12 SR items, 12 DR items, and 
12 MDR items. In this way the degree of measured discordance in the 
attitudes of a subordinate pair was varied for each manager, though held 
constant from manager to manager. The subordinates under each branch 
manager were also classified into three groups: (a) AP (attitude predict- 
able); these were the subordinates that a branch manager felt most con- 
fident in making predictions about and with whom he was most ac- 
quainted; (b) AU (attitude unpredictable); these were the subordinates 
that a branch manager felt least confident in making predictions about 
and was least acquainted with; and (c) AD (attitude divergent); these 


238 VICTOR B. CLINE 


were salesmen at either extreme on a "dissatisfaction index" taken from 
the sales attitude questionnaire. Each judge predicted on 19 items for 
each of the three pairs of subordinates, AP, AD, and AU. 

Each judge (branch manager) received a different 36-item pair in- 
ventory devised individually for him. Because of the highly restrictive 
conditions set in controlling the four response sets in addition to AP, 
AU, AD and SR, DR, and MDR conditions, it was necessary to program 
a "look up and comparison" table using the Univac 1108 computer. Over 
14,000 IBM cards and 20 hours of computer time were necessary to list the 
eligible items in order of priority. Following this, some 40 additional 
hours were involved in selecting the 10,080 forced-choice items from the 
computer printout. Hatch concluded that the costs were excessive and 
that the Forced Choice Differential Accuracy approach, “Is probably not 
a practical solution to the empathic measurement problem.” It was also 
indicated that if the same computer data were used by another investiga- 
tor, precisely the same inventories would not result and, in fact, Hatch 
would not be able to repeat the process if he were, himself, to do it again. 


The final results indicated that empathy as r 


measured here was not 
related to human relations skills but was somewhat related to degree of 


acquaintance between manager (judge) and subordinate as well as degree 
of inference or extrapolation required. While his good judges as a group 


were able to predict at better than chance (P < :05), Hatch noted that 


“much but not all of the prediction of the study could be accounted for 
by chance alone.” 


Because this study represents a major effort to eliminate the methodo- 


logical gremlins which plague interpersonal perception research, it is a 
Significant contribution to the literature. Yet the lack of any relationship 
with human relations skills, as well as the only slightly above chance 
accuracy scores obtained by judges, is of considerable concern to this 
reviewer. Serious questions arise about the potential future of an ap- 
proach where such costly and careful controls yield such modest results. 
E. Accuracy versus Process APPROACHES IN Rx: 
In the person perception literature 
tween research which emphasizes accura 
focuses on process (i.e., how we judge). 
A growing number of researchers have ab 
assessment of judging ability as a trait in f. 
only in a specific instance with a partic 
strument. 
Tagiuri and Petrullo (1958) hav. 
research emphasizing process by the 


ARCH 


a sharp cleavage is apparent be- 
cy of judgment and that which 


andoned work on the 
avor of the view that this exists 
ular person using a certain in- 


€ tried to turn the tide in favor of 
Publication of а major collection 
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of papers contributed predominantly by workers in this group. Tagiuri, 
in the same volume, has stated the case with considerable vigor: "For a 
number of reasons, attempts at studying correlates of accuracy have with 
very few exceptions produced negligible correlations and yielded very 
little insight into processes. First there is no single satisfactory criterion 
against which to match the judgments. The criteria used—objective be- 
havior, self ratings by the object person, ratings by the experts, consensual 
ratings by peers—do not always agree and have very different psycho- 
logical implications. Second, the disparity of tasks and abilities subsumed 
under the various operations called measures of accuracy have been 
glossed over. It is also probable that different judgmental skills may be 
involved in different situations. In addition, most accuracy scores contain 
some seven different and not necessarily correlated components. There is 
furthermore the extreme dependence of results upon judgmental sets and 
upon the distributions of the variables that are to be judged. Finally, 
most of the studies are inconclusive because of the lack of representative- 
ness in the design employed. In sum, investigations yield data that are 
difficult to interpret and impossible to compare. It is the process rather 
than its achievement that one must investigate if a broad understanding 
of the phenomenon is to be reached. This point cannot be stressed suf- 
ficiently. It is also important to realize that the difficulties encountered 
in quantitative studies of accuracy cannot be eliminated by resorting to 
careful qualitative approaches. Such phenomena as real and assumed 
similarity, stereotype and differential accuracy, favorability sets, and 
artifactual relations, to mention but a few, apply to any kind of inquiry 
into person perception.” 

Fiedler (1960) stated that he is not interested in accuracy of judgment 
per se, but only in what kinds of perceptions, regardless of accuracy, are 
related to such behavioral outcomes as winning basketball games, ac- 
curacy of bomber crews, etc. 

However, Crow (1960) has responded to this position by commenting, 
al perception been freed from the pre- 


“Only in recent decades has physic 
Occupation with mediational detail and the stimulus bias that has charac- 
terized its history. In person perception research, we must examine cau- 
tiously any call to focus on process that turns its back on accuracy. My 
point is not that we should ignore process nor should we fail to eliminate 
from our scores those components which invalidate our conclusions. The 
danger lies in the possibility that we will ignore accuracy and that the 
components once eliminated will not receive the attention they require. 
We should study process but we should not become bogged down in the 
technological details of the machinery by which the organism arrives at 
interpersonal perceptions. We must keep in mind that the process we are 
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studying is a process of functional achievement. If we fail to do this in 


person perception research we may engage in a tedious reinactment of 
the history of physical perception." 


IV. Analysis of Judgments from Motion-Picture Samples 


The present author has been conducting a series of research studies 
in the area of interpersonal perception since 1951. Many have been in the 
nature of large-scale pilot studies which have frequently yielded negative 
results and have not been published. However, negative results are fre- 
quently as important and necessary to progress as positive findings and 
both kinds will be reviewed in the following section. 

In preliminary consideration of the literature the writer noticed the 
frequent use of still photographs as stimuli for judgments. While this 
kind of “limited cue" study has its place, the writer is in agreement with 
Bruner and Tagiuri (1954), who commented that "A vast literature arose 
based on judgments of emotions from still pictures and drawings of 
human faces, when we know that it is rarely that one makes a judgment 
based upon a frozen millisecond of exposure to a f. 
with all other forms of information 1 
ventional analysis of expression carrie 
value and understanding of stimulu 
emotion... .” 


ace expressing emotion 
acking—it seems unlikely that con- 
Ч out with callipers will yield much 
5 properties of faces in a state of 


The writer and his collaborators we 
dynamic true-to-life type of stimulus w 
interview (following the work of Estes, 
Luft, 1949; and others) appeared to be 
photographs or drawings of faces, This 


of verbal and visual cues in situations allowing for repeated observation. 
Interview behavior appears also to be more typical of the kind of phe- 
nomenon upon which judgments are made in real life. In early research 
conducted at the University of California by the writer, a dilemma pre- 
sented itself. It was found Оп an experimental basis that 2 hours proved 
to be the maximum period for which most judges could tolerate looking 
at films and fill out judging instruments, at least at one sitting. "Thus, 
one might make a film of an interview lasting 45 minutes to an hour and 
present one or two of these to a group of judges for evaluation, but the 
number of "others," (social objects or persons judged) was extremely 
small and limited, Or as another alternative the experimenter could 
present a fairly large number of I-minute filmed interviews, but so limit 
his judges in the amount of information presented to them that it almost 
approached the still Photographs as a source of data. Thus, in time, it 
became apparent that the filmed interview could not be a typical, un- 


re of the opinion that a more 
as needed. A sound film of an 
1938; Gage, 1952; Giedt, 1951; 
à possible alternative to immobile 
permits the recording of a variety 
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structured, loose, clinical type of interview, but must in as short a time 
possible compress, capture, and present a wide range of information both 
objective and factual in content as well as emotional and reactional in 
nature. 


A. CONSTRUCTION OF FILMS 

Alter some 6 months of experimentation, the following procedure 
was developed. Sound motion pictures in black and white were taken of 
nine male college students using a hidden camera and microphone. Inter- 
viewees initially knew only that they were to report for job interviews in 
their particular fields of study. The interviews were very highly structured 
and were held relatively constant for all those participating. Each inter- 
view had three phases: (1) A standard opening session in which stock 
employment questions were asked by a mature and skilled actor posing 
as a prospective employer; (2) a stress session in which the “employer” 
became critical and mildly stressful in his questioning; and (3) an abreac- 
tion session in which the inteviewer stopped playing the role of the em- 
ployer, revealed the true experimental intent of the interview, and be- 
came quite humble and deferential. He invited comments about how the 
interviewee had felt in such a situation and what had bothered him the 
most. It was at this point that the interviewee relaxed greatly and usually 
became quite verbal and very [rank, indicating points of tension, block- 
ing, embarrassment, and his degree of insight into the total situation. 
This was all recorded by the hidden camera and microphone. Permission 
was then secured to use the films for research. The four best films of the 
nine, representing the most diverse personalities, were presented to large 
populations of judges. 

This employment interview which was in part a stress interview, had 
been suggested by one of the OSS (1948) situational procedures used 
during World War II. In developing this technique for the present re- 
search everything possible was done to compress in 11 minutes a great 
range of information about the individual being interviewed, both ob- 
jective and emotional. 

Three judging tests were constructed. 


1. Behavior Postdiction Test 
Here the judge was required to describe or postdict how the inter- 
viewee behaved in everyday life. A series of questions was set up with 
à multiple-choice format. Example: 
14. When the interviewee (in the film) is in a violent argument usually 


he: 
(a) Becomes very sarcastic, 
(b) Uses profanity and obscene words, 


ho 
He 
ho 


VICTOR B. CLINE 


(c) Leaves the room or area, 
(d) Strikes his opponent with his fists. 


There was only one correct response. The other alternatives were 
were carefully constructed so as not to include even partially correct 
statements. A separate and individually tailored Behavior Postdiction 
Test was constructed for each interviewee. Other items tapped such areas 
as relations with women, behavior at social gatherings, handling money, 
etc. The experimenter knew how each interviewee characteristically be- 
haved, and was able to write items tailored specifically to his personality 
because intensive interviews һай been held with him, and his family, 
close friends, fraternity brothers, and/or fiancee, or wife, where available. 
Only those behaviors which all agreed upon as being characteristic 
were used, and the incorrect alternatives were chosen with equal care 


so as to eliminate even partial or occasional kinds of behaviors. The 


judging score was the total number of correct predictions made for all 
four filmed interviews. 


2. Personality Word Card 


Next the judges were required to predict the interviewees’ “verbal 
behavior.” This was done in the following manner. 
on three previous occasions, about a month apart, filled out the Person- 
ality Word Card, checking those adjectives which they felt best described 
themselves. This card was merely a group of 200 descriptive adjectives 
such as, “persistent,” “anxious,” "aggressive," 

tain adjectives were "marginal," that 
of the three occasions he filled it out 


The interviewees had 


“frank,” “Dossy,” "sexy," 
and so forth. It was found that cer 
is, the $ checked them on only one 
for himself. Other adjectives proved to be more centr 


in nature, being checked as self descriptive two 
of three. These were retained in 


al and important 


and/or three times out 
a final key. Thus, each interviewee had a 
unique pattern of adjectives which represented an important and con- 


sistent part of his self perception. Each judge then was r 

seeing the film, to predict which adjectives had been checked by the inter- 
viewee in describing himself. Each judge was told to check only about 
one third of the adjectives in making his predictions on each interviewee 
because the interviewee keys average one third the total number of 


adjectives. The judging accuracy score for each judge was based upon 


the percentage of correct predictions averaged for all four of the test 
films. 


equired, after 


3. Multiple Choice Sentence Completion Test 


This technique was 


also used but abandoned because of low reli- 
ability. Here the judge 


attempted to predict the sentence completion 
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responses, using a multiple-choice format, for each of the interviewees in 
the films. 


B. PRELIMINARY RELIABILITY AND VALIDITY 


Reliability coefficients for each of the three techniques were com- 
puted by employing the Spearman—Brown formula to the correlation 
between odd and even halves. The Postdiction test yielded a reliability of 
.56. The Personality Word Card yielded a reliability of .83. The reli- 
ability of the Multiple Choice Sentence Completion Test was only .35, 
and for this reason was discarded as an index of judging ability. None 
of these were corrected for attenuation, which would have given them 
higher reliabilities. An equally weighted composite of scores on the first 
two judging tests (Postdiction Test and Personality Word Card) was ob- 
tained for each judge. These two measures intercorrelated .43 + .10. 
In the initial experiment five groups of judges viewed these four films and 
made judgments. After each filmed interview had been shown the pro- 
jector was stopped and they were asked to make their judgments on the 
instruments discussed. These Ss were (a) 109 undergraduate college stu- 
dents, (b) 106 professionals (clinical psychologists, psychiatrists, and 
graduate clinical trainees), (c) 47 adult members of a Protestant church 
congregation, (d) 43 nursing trainees, and (e) 11 advanced engineering 
students. 

The results may be briefly summarized as follows: 
judges performed at very much better than chance success (.001 level of 
confidence) on both the Behavior Postdiction Test and the Personality 
Word Card. The question arose, however, "To what extent would the 
judges have done just as well without seeing the films but merely filling 
in the tests according to their stereotype of a typical college male?" To 
answer this question a special control group of 57 undergraduates filled 
out the forms without seeing any of the films whatsoever, merely using 
hunches or social stereotypes about college males, in responding to the 
various questions. Their level of accuracy was termed “psychological 
chance.” When the control and experimental groups of judges were com- 
pared in their performance the test indicated significant differences, 
better than the .001 level of confidence, on both judging instruments, 
favoring those people who had seen the films. The evidence seemed to be 
fairly conclusive that the judges who saw the films were making fairly 
accurate predictions or judgments on the basis of a differential analysis 
and a real evaluation of the personalities of the films rather than from 
some crude internalized stereotypes of what college males were like. 

This study was conducted in 1952 prior to the publication of most 
of the articles suggesting components, RS-AS-ACC problems, and the 


All groups of 
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like. However, it should be noted that the D? statistic, which later proved 
so fallible, was not used in this study, even though the possibility of other 
"error" or artifacts remains, 

When the judges of the five subgroups were compared for accuracy a 
three-level hierarchy was found. Most 


accurate and proficient were the 
professionals, that is, the psy 


chiatrists, psychologists, and clinical trainees. 
At level two were the nursing trainees, and at level three the college 
students and members of the church congregation as well as the advanced 
engineering trainees. The differences in accuracy between levels one and 
three on both the Postdiction Test and Personality Word Card were 
significant at the .05 level of confidence. The differences between levels 
one and two, and two and three, however, did not quite reach significance. 
It was also found that women consistently obtained slightly higher 
judging scores than men. While these differe 
significance they were consistent in this study 
conducted by the author. It was 


nces never quite reached 
às well as most other studies 
also found that some of the interviewees 
or "others" in the films appeared generally easier to judge than others. 
Length of professional psychological or psychiatric experience was re- 
lated to (a) greater accuracy in using the Personality Word С 
predicting “verbal” behavior, and (b) decreased 
real life social behavior on the Behavior P 
was based upon a comparison of the 
psychiatrists and Psychologists having 
with the accuracy of 


tard, that is, 
accuracy in predicting 
ostdiction Test. This finding 
accuracy of predictions made by 
more than ten years experience 
predictions made by those having less than three 
years experience. The relatively new and less experienced clinicians were 
a little less accurate in predicting verbal behavior (P= .10), but were 
considerably more accurate in predicting real life behavior (P = .02) 
than were their much more experienced and “older” 

suggested that there may be danger with increasing experience in becom- 
ing more sensitive and aware of verbal behavior but less aware and less 
in contact with reality with real life social behavior. The top and bottom 
quartiles in judging ability of the college sample filled out Gough's 
Adjective Check List describing themselves, The better judges signifi- 
cantly more often characterized themselves on this check list as "sym- 
pathetic" and “affectionate.” The adjective self checks characterizing the 
poorest judges included “dissatisfied,” “irritable,” “awkward,” 
and “hurried.” 


colleagues. This 


“praising,” 


With regards to the personality correlates of judging ability, as here 
defined, results were available for the sample of 109 college under- 
graduates. They were given a battery of tests which included the MMPI 
scored for 22 scales, the California Public Opinion Scales (including Е, 


F, and PEC), the Henmon-Nelson Intelligence Test, the Gough Adjective 
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Check List, the Brunswik Faces Test (wherein one estimates the in- 
telligence, likeability, and so forth of 46 males on the basis of small still 
photographs of their faces); the Barron-Welch Art Scale, the Group 
Opinion Estimate (where one attempts to predict what percentage of 
the general population responds true to each of 30 MMPI items, such as 
"I pray several times a week"), and the Klein Social Prediction "Technique, 
wherein one is given a transcript of a patient's remarks in a clinical 
interview that is interrupted at a crucial point, and five possible endings 
are presented, only one of which is correct (the task here being to predict 
the correct alternative). 

In summary it was found in this study that the most significant 
correlates of judging ability were (a) absence of ethnocentricauthoritarian 
attitudes (Fascism and Ethnocentrism correlated —46 and —.32), (b) 
superior intellectual ability, (c) lower scores on MMPI scales for 
Hypochondriasis, Dissimulation, Paranoia, Schizophrenia, Psychopathic 
Deviate, Prejudice, and F (validation), and (d) higher scores on the MMPI 
Social Status and Intellectual Efficiency measures. When correlations were 
run between judging ability and these various tests, for the sexes sepa- 
rately, some male-female differences emerged, which are discussed in 
greater detail elsewhere (Cline, 1955a). Sex differences in interpersonal 
perception have been found in nearly all studies conducted by this in- 
vestigator, indicating the necessity of separate analyses of data for male 
and female judges, and probably for "others" when they include both 
sexes, 


C. Tur GENERALITY OF JUDGING ABILITY 

Cline and Richards in 1957 began a new series of studies at the 
University of Utah in the area of interpersonal perception. The initial 
focus was on the problem of generality and its relation to a component 
type of analysis of global accuracy scores. A new set of filmed interviews 
in sound and color as well as much larger and varied group of judging 
instruments was developed. These instruments included 

(1) Behavior Postdiction test. The development and format of this 
were similar to the same test cited in the previous study. 

(2) Sentence Completion test. This is very similar to the previously 
discarded Multiple Choice Sentence Completion test described previously. 

(3) Adjective Check (ACL). This evolved from the older previously 
Mentioned Personality Word Card. In this case the judges were required 
to predict which of 20 adjectives the interviewee (or “other”) had 
Previously checked as being self descriptive. Only those adjectives which 
the interviewee had consistently checked on three different occasions as 
applying to himself were used as the “correct” items, and only those 
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adjectives never checked as self descriptive were used as the incorrect 
alternatives. Ten of the 20 adjectives were "correct" and the other ten 
"incorrect." In choosing the "correct" and "incorrect" adjective items 
for this judging test from the original 300 adjective pool, an attempt was 
made to choose socially desirable items as well as undesirable items in 
about equal numbers as correct and incorrect on the scoring key. | 

(4) Opinion Prediction test, Here the judge was required to predict 
how the interviewee (in the film) answered 20 MMPI items such as, “ ‘I 
fall in love easily, True or False?” Only those MMPI 
interviewee on three different occasions answered consistently the same 
way were used. It will be noted that each of the 25 interviewees had 
previously filled out answers to a selected group of 80 MMPI iterns on 
three occasions (several weeks apart). It was from this 80-item pool that 
the 20 consistently answered items were selected. An 
obtain ten of these items scored in the True dire 
False direction and, additionally, th 
responses correct and ten 


items which the 


attempt was made to 
ction and ten in the 
at ten should have “favorable” type 
“unfavorable” correct to prevent the judges 
from obtaining high accuracy scores merely by going through and check- 
ing only the “socially desirable” (or favorable) responses, Since this scale 
маз constructed on the basis of each interviewee's responses, the particular 
items included in the scale differed from one interviewee to another. 

(5) Trait Rating test. Here the judge was required to rate the 
interviewee on 50 traits (later reduced to 25) such as “cooperative,” 
“impulsive,” “efficient,” “a leader,” etc., on а six-point scale ranging 
from “very like" to “very unlike.” The criterion was obtained from the 
mean of the ratings (on each of the items) of the interviewee by his 
friends and associates, Two other secondary types of criterion scores were 
also obtained: (1) the researcher's final summary ratings of the interviewee 
after evaluation of all of the test and interview data; and (2) the mean of 


the interviewee's own self ratings, done three times, This Trait Rating 
procedure was amenable to a Cronbach-type analysis into components. 

Various revisions in scoring systems and format occurred with all of 
these instruments over time and a number of other instruments were 
developed, some of which are men 


З tioned later in this paper. 
While the problems in interpreting global scores of any kind were 
recognized, the different judging 


instruments were 
results indicated in Tables V through XIIL2 In 


intercorrelated with 
addition, correlations 


ation to the following journals for per- 
lowing tables: Journal of Abnormal and 
Journal of Applied Psychology, Tables 
i апа Psychological Reports, Tables XVI- 
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were obtained between single instruments and Total Judging scores. In 

this case, Total Judging scores were obtained by converting scores on 

each judging instrument to standard scores and adding across instru- 

ments. All correlations between individual instruments and Total Judg- 
TABLE V 


INTERCORRELATIONS AMONG JUDGING MEASURES FOR UNIVERSITY ОЕ UTAH STUDENTS 
Usne THE SIX- AND TEN-FILM Forms 


(N — 295) 
Ten-film forme Six-film forma 
Trait Adjective Total Trait Adjective Total 
Rating Check Judging? Rating Check Judging? 
Behavior Postdiction 24 04 31 36 28 39 
Trait Rating 31 44 41 50 
Adjective 31 44 
r 05 = ЛІ r 01 = 15 


а Note: Through procedures resembling item analysis, the number of films used 
during a judging test was eventually reduced to the "best six." And unless otherwise 
specified in the tables, it is this group of films which was used. Where the “10-film 
form," “20-film form,” etc. are referred to it means that the particular research was 
performed before the final "best six" films had been selected. 

v Part-whole correlations corrected for the contribution of the part. 


TABLE VI 


INTERCORRELATIONS BETWEEN ACCURACY SCORES ON 
UNIVERSITY OF UTAH STUDENTS 


DIFFERENT JUDGING MEASURES FOR 


(М = 51) 
Trait Opinion Adjective Total 
Rating Prediction Check Judging 
Behavior Postdiction 216 20 57 37 
Trait Rating 31 13 28 
Opinion Prediction 25 32 
Adjective Check 36 
r 05 = 27 r 01 = 35 
a Part-whole correlation corrected for contribution of the part. 


TABLE VII 
INTERCORRELATIONS AMONG JUDGING MEASURES FOR 
Inano STATE (MENTAL) Hosprrat PATIENTS 


(N = 32) 
Trait Adjective Total 
Rating Check Judging? 
Behavior Postdiction Al 61 67 
Trait Rating al 37 
Adjective Check 57 
т.01 = .45 


т 05 = .35 


а Part-whole correlation corrected for contribution of the part. 
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ing scores were part-whole correlations corrected for contribution of the 
part. In Table XV are given some of the reliabilities of the judging 
instruments. In Table XIV are listed the correlations between Trait 


TABLE VIII 
INTERCORRELATIONS AMONG Jurcic Measures ron T Group 


MEMBERS 
(N = 118) 
Trait Opinion Adjective Total 
Rating Prediction Check Judging? 
Behavior Postdiction 28 36 36 52 
Trait Rating 27 .03 26 
Opinion Prediction 07 AT 
Adjective Check 35 
r05—.17 r 01 = 23 
в Part-whole correlation corrected for contribution of the part. 


TABLE IX 
NS AMONG JUDGING MEASURES FOR University ор UTAH STUDENTS 
(N = 50) 


Trait Behavior Sentence 


INTERCORRELA TIO; 


Opinion 
Rating Post- Com- Pre- Adjective Total 
Measure Total diction pletion diction Check — Judging? 
Trait Rating Total — 
Behavior Postdiction 30 — 
Sentence Completion AB 47 — 
Opinion Prediction 52 50 47 — 
Adjective Check 65 24 58 54 = 
Total Judginge 63 44 63 65 66 = 
r 05 = .27 


r Ol = 35 
@ Part-whole 


correlations corrected for the contribution of the part. 


TABLE X 


TEAsuREs FOR UTAH STATE (MENTAL) HosPrrAL STAFF 


(N = 62) 
Behavior 


INTERCORRELATIONS AMONG Јороіхс М 


Trait Adjective Total 
Postdiction Rating Check Judging 
Behavior Postdiction — 50 22 42 
Trait Rating — 36 48 
Adjective Check = EH 
Total Judging Score = 
r 05 = 25 r 01 = 33 


@ Part-whole correlations corrected for the 


contribution of the part. 
Rating Total score (where Trait Rati 
instrument susceptible to a Cronbach-type component analysis, and the 
Trait Rating Total score is the global unanalyzed measure), its various 
components, and four other entirely different types of judging instru- 


ng is a difference score judging 
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ments (Behavior Postdiction, Sentence Completion, Opinion Prediction, 
and Adjective Check—none of which were susceptible to a component 
analysis), plus a final Total Judging score, combining scores on all five 
instruments (through conversion to standard scores). 

The conclusions drawn from these studies and data were that the 
evidence indicated that a general, global ability to judge others accurately 
could be meaningfully measured. However, the data in Table XIV 
suggested that this global ability involved a complicated dynamic process, 
or, more precisely, a factorially complex process that helped to clarify 
What was general in the global measure. Stereotype Accuracy appeared to 


TABLE XI 
INTERGORRELATIONS AMONG JUDGING MEASURES FOR UTAH STATE (MENTAL) HOSPITAL 
PATIENTS 
(N = 56) 
Behavior Trait Adjective Total 
Postdiction Rating Check Judging? 
Behavior Postdiction — 41 32 42 
Trait Rating — 30 44 
Adjective Check == 31 
Total Judging = 
r 05 = 26 r 01 = 31 


@ Part-whole correlation corrected for the contribution of the part. 


TABLE XII 
INTERCORRELATIONS AMONG JUDGING MEASURES FOR University or UTAH STUDENTS USING 
20-Еп.м Form 


(N = 23) 
Behavior Trait Adjective 
Postdiction Rating Check 
Behavior Postdiction = 36 09 
Trait Rating = 06 
Adjective Check € 
r 05 = 40 r 01 = 51 


account for a large portion of this generality. However, after this com- 
ponent was eliminated, considerable generality remained, which appeared 
to be related mainly to the Differential Accuracy component. Thus these 
two components were to some degree independent. It appeared that one 
might be an accurate judge because one has an accurate stereotype or 
because one is able to predict specific differences between individuals or 
both. Bronfenbrenner et al. (1958) came to a nearly identical conclusion 
using quite different research procedures. They referred to the two com- 
ponents as "Sensitivity to the Generalized Other" and “Interpersonal 
Sensitivity" (or Stereotype Accuracy and Differential Accuracy in Cron- 
bach's terminology). 


250 VICTOR B. CLINE 


It would appear paradoxical (or inappropriate) to conclude both 
that judging ability is general and that at the same time it consists of 
two relatively independent components. However it is probable that these 
seemingly conflicting conclusions mean that the ability to judge others 


TABLE XIII 
INTERCORRELATIONS AMONG Jupeinc MEAsures ror UNiversity ОЕ UTAH Stu 


NINE-FILM Form 


DENTS USING 


(N = 26) 
Behavior Sentence Opinion Adjec- — Adjec- 
Post- Com- Pre- tive tive Trait Trait 
diction pletion diction Check I Check II Rating I Rating II 
Behavior 
Postdiction — Br] —.20 94 01 —20 06 
Sentence 
Completion = 20 13 44 32 08 
Opinion 
Prediction — 26 42 26 16 
Adjective Check 
I (no. of adjec- 
tives correctly 
predicted) — 41 00 19 
Adjective Check 
П (9 adjec- 
tives correctly 
predicted) — 29 18 
Trait Rating Ia 


(sum of dif- 

ference scores 

where criterion 

is 5 friends’ rat- 

ings of “other” - 50 
Trait Ratinga 

(sum of dif- 

ference scores 

where criterion 

is the "other's" 

self ratings) — 

r 05 = 38 


tions have been changed so that they 


judging i are consistent with other 
Judging instruments. 


accurately is factorially complex and that the i 
reflect this complexity. It is, therefore, still me 
general trait. There are numerous precedents 
psychology. For example, intelligence, which ] 
consist of several independent components, is 


ndependent components 
aningful to talk about а 
for this in the history of 
has been demonstrated to 
still treated as a meaning- 
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ful general trait. In investigations of intelligence, even those studies indi- 
cating a G factor have typically also shown several independent group 
factors. 

One possible criticism of the above research would relate to the 


TABLE XIV 
CORRELATIONS BETWEEN TRAIT RATING COMPONENTS AND OTHER JUDGING MEASURES 


(N = 50) 
Trait 
Differ- Stereo- Differ- Rating 
ential type ential Total 
Measures Elevation Elevation Accuracy Accuracy — Scorea 
Elevation Ex 
Differential Elevation 12 т 
Stereotype Accuracy 01 = 
Differential Accuracy :29. 16 — 
Trait Rating Total Scores 24 16 25 um 
Behavior Postdiction —23 66 01 30 
Sentence Completion :08 60 25 48 
Opinion Prediction 00) 59 28 52 
Adjective Check 33 44 56 65 
Total Judging (All 
Instrument) 16 10 Л76 53 63 
M 1015 1162 17704 1.4392 24273 
с 1154 0589 3911 5759 8139 
r 05 = 27 r 01 = 35 


@ The correlations between the various components and the Trait Rating Total 
Score involve the total score with the contribution of the particular component under 
consideration eliminated. In computing the correlations between these component 
Scores and the (all instrument) Total Judging score, however, it proved to be impossible 
to eliminate the contribution of the particular component under consideration, since 
the components represent the relative contribution of these components to the variance 


accuracy, 
TABLE XV 
RELIABILITIES OF JUDGING INSTRUMENTS 

5 Films 10 Films 

vs. 5 films vs. 10 films 6 Films 

N=50 N = 20 only, N = 1490 
Trait Rating 42 81 60 
Behavior Postdiction 66 37 
Sentence Completion 67 — 
Opinion Prediction 67 59 
Adjective Check 79 61 
Total Judging т 54 


? Reliabilities for Behavior Postdiction, Opinion Prediction, and Adjective Check 
ате computed by Kuder Richardson Formula 21. Trait Rating Reliability is a Spearman— 
Brown Coefficient based on the first three films—second three films split. Total Judging 
reliability was estimated using the formula presented by Mosier (1943). 
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possibility that the generality obtained was due to real similarity between 
persons judged. This view would suggest that the selection of the films 
on the basis of their discrimination of good judges from poor, as was 
done in this research, would result in a selection of films in which the 
"others" or interviewers were more similar to one another th 
to the "others" in the eliminated films. 

In order to check this possibility, a speci 
Trait Rating, 


an they were 


al key was prepared for the 
based on the mean rating on each trait of the criterion 
scores of the 10 films included in the study. Then an "Index of Similarity" 
was computed for each of the 10 best films included and for each of the 
10 films eliminated on the basis of the earlier pilot study by scoring the 
criterion ratings for each film against this special key. The two groups of 
films were then compared with regard to this Index. The results of this 
comparison (t — 1.694, df — 18) fail to support a significant difference 
between the groups of films in this respect. The small "chance" difference 
that does occur, however, is in the expected direction. Generality of 
judging ability to the extent found in this study cannot, therefore, be 


explained on the basis of real similarity to one another of the persons 
judged. 


D. RELATIONSHIP BETWEEN THE SCORING SYSTEM AND COMPONENTS 


In Cline and Richard's early research it appeared that Cronbach's 
Stereotype Accuracy and Differential Accuracy components were going 
to provide a way out of the methodological morass in which most research 


had bogged down. However, additional work with Stereotype Accuracy 
and Differential Elevation demonstrated empirically that differences in 
the scoring system could spuriously incre; 


them, making Differential Accuracy diffic 
and generating misleading conclusions, TH 
"Interpersonal Accuracy" w. 
the following manner. 


ase, reduce, or even eliminate 
ult or impossible to interpret 
herefore a new measure termed 
as developed (Richards and Cline, 1963) in 


the components problem led to the 
nventory which required the judge to 
Likert-type items dealing with religious 
ing is a sample item: 


predict "other's" responses to 12 
beliefs and values. The follow 


When in doubt, I h 


ave found it best to stop and ask God for guid- 
ance. 


- Strongly agree 

Agree 

- Neither agree nor disagree 
- Disagree 

- Strongly disagree 


MUO > 
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In the filmed interview the "other" had been asked direct questions about 
his attitudes toward religion. 

Subsequent research with this instrument indicated that the scoring 
system had a differential effect upon the components involved in a 
Cronbach-type analysis. With items of the type included in the Belief- 
Values Inventory, there are two possible scoring methods. The first of 
these is to score "Strongly Agree" as 1, "Agree" as 2, etc., without regard 
to whether or not on that particular item, "Strongly Agree" is a pro- 
religious answer. The second possible scoring system is to score the most 
conventional proreligious response as 1, regardless of whether that 
answer is "Strongly Agree" or "Strongly Disagree." If the first of these 
scoring systems is used, the Stereotype Accuracy variance is large, but the 
Differential Elevation variance is made artificially small. On the other 
hand, if the second of these scoring systems is used, the Stereotype 
Accuracy variance is artificially reduced, while the Differential Elevation 
variance is maximized. These effects are illustrated by Table XVI, which 
presents the responses of three hypothetical persons to items of this type, 
with each item score presented in both of the two scoring systems. In 
"Table XVI, the first hypothetical person always answered with the most 
conventional religious answer; the second hypothetical person always 
answered with the second most conventional religious answer, and the 
s gave the middle or neutral response. 


third hypothetical person alwa 
In this table the consistency of responding by each person is, of course, 
somewhat exaggerated to make the point clear. 

In addition to the effects of the scoring system on the variance of 
Stereotype Accuracy and Differential Elevation, several other things are 
apparent from Table XVI. If these three hypothetical persons were used 
as "others" in the films and the first scoring system were used (i.e., where 
"Strongly Agree" is always scored as 1, regardless of religious direction), 
no matter what degree of accuracy a judge attained in predicting their 
responses, the Differential Elevation correlation component could take 
no other value than .00. The Differential Elevation correlation term 
could have no relationship to other measures of judgment and might 
compel the erroneous conclusion that there is no generality of judging 
ability. Use of the second scoring system (i.e., where the most proreligious 
response is always scored 1 regardless of whether it is "Strongly Agree" 
or "Strongly Disagree") has a similar effect on the Stereotype Accuracy 
correlation and again could lead to an invalid conclusion that there is no 
generality of judging ability. It will also be seen from Table XVI that, 
if the first scoring system were used in a study of judging ability, Dif- 
ferential Elevation and also Elevation would reflect primarily the extent 
to which judges interpreted items in the same way as the "others," but 
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that, if the second scoring system were used, Differential Elevation would 
be a measure of the judges’ "sensitivity to individual differences" in 
overall religiosity, and Elevation would be a measure of the judged 
average religiosity of the group of "others." Thus, the apparent paradox 
in Cronbach's formulation is resolved. All of this, taken together, strongly 
suggests that in investigations of accuracy of person perception, and 
particularly of its generality, neither of these scoring systems is by itself 
satisfactory, but rather that the first scoring system should be used in 
computing Stereotype Accuracy and its components and the second 
scoring system should be used in computing Differential Elevation and 
its components. It should be emphasized that, in the hypothetical ex- 
ample, the items differed greatly in their over-all degree of religiosity. 
In studies of accuracy of interpersonal perception, both of these would 
be important and yet either one or the other would inevitably be 
artificially eliminated if either scoring system alone were used. 

These two scoring systems also have another effect which is not 
readily apparent in the hypothetical example since it eliminates all the 
persons-by-items interaction that would occur in a real problem. This 
effect is that the Differential Accuracy component and its correlation 
and variance constituents will all take on different values depending on 
which scoring system is used, and there is apparently no criterion which 
would indicate in a real problem which of these values are the most 
appropriate measures of judging ability. This indicated that under 
certain circumstances none of the values of Differential Accuracy and its 
constituents are particularly good measures of judging ability. Thus it 
was replaced with the new index of judging ability called Interpersonal 
Accuracy. 

"There is no difference score form of this measure; it consists only of 
a correlation term and a variance term. The correlation term is com- 
puted by determining the correlation between each judge's predicted 
values and the corresponding actual responses by "others" on individual 
across items (without converting these scores in 


items and then averaging 
1 person means as is the case with 


terms of their discrepancy from item anc 
Differential Accuracy). Similarly, the Interpersonal Accuracy variance 
term involves the computation of the variance of each judge's predictions 
on individual items, averaged across items. In terms of Cronbach's 
scheme, it appears to be a linear combination of Differential Elevation 
and Differential Accuracy. It offers the strong advantage over other 
measures that it is invariant under changes of scoring system. 

There are several additional considerations in the interpretation of 
the hypothetical example mentioned above. The first of these is th? 
there is a strong general "religiosity" factor underlying the questic 


ho 
or 
a 
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In Cronbach’s scheme, this general factor should be tapped by the 
Differential Elevation component, and this does occur under the scoring 
system in which the conventional religious answer is always scored 1. It 
might be objected, therefore, that the scoring system in which “Strongly 
Agree” is always scored 1 "randomizes" this general factor. This effect 
of this scoring system is, however, exactly the point of the hypothetical 
example; and it should be emphasized again that the scoring system in 
which “Strongly Agree” is always scored 1 is the only scoring system which 
permits the real differences in the average degree of endorsement of the 
items (Stereotype Accuracy) to appear. It should also be noted that the 
nature of the questions presented in the | 
intentionally made such as to emphasize 
scoring system for the Differential Elev 
usual procedure in studies of person per 
ous groups of personality traits as items, and, as Cronbach points out, 
the assignment of scores to such items is frequently arbitrary. As a result, 
if such items are used, the scoring system will still have the effects out- 
lined above, but in such complex and confusing patterns as to make the 
various component scores almost wholly uninterpretable (with the pos- 
sible exception of the Total Score). The fact that there is a strong general 
factor in the hypothetical example would contribute greatly to the clarity 
of interpretation. of the Interpersonal Accuracy component, and the 
author is in agreement with Cronbach's position that several factorially 


pure sets of items analyzed Separately are preferable to one factorially 
complex set of items treated in a global fashion, 

If the argument is 
whether the considerati 


hypothetical example was 
the inappropriateness of this 
ation component. It is a more 
ception to use rather heterogene- 


In this research 46 
female, at the University of Utah, pre- 
ndard “others,” presented through the 
re, on the Belief-Values Inventory. Details of 

ing these filmed interviews are presented 
1960). The Predictions of these judges 
erion, once with each of the two scoring 
various judgment scores intercorrelated. 
rrelation terms are expressed in terms 


these analyses are presented in Table 
XVII. In this table, correlations above the di 


“Strongly Agree” was always scored ] rega 


elsewhere (Cline and Richards, 
were scored twice against the crit 


elations below the diagonal 
al proreligious answer was 
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always scored 1. On the basis of either of these two groups of correlations 
alone, one would have to conclude that there is no consistent pattern of 
generality in judging ability, particularly as reflected in the three correla- 
tion measures, but rather that there appear to be two relatively independ- 
ent factors measured respectively by the Differential Elevation correlation 
term and the Stereotype Accuracy correlation term, thus confirming the 
previous results of Bronfenbrenner and his associates (1958) and Cline 
and Richards (1960). 

A further analysis of these data was made, however, in which judg- 
ment scores were intercorrelated across scoring systems in such a way that 
each component was scored most appropriately. More specifically, 
Stereotype Accuracy and its correlation and variance terms were com- 
puted using the scoring system where "Strongly Agree" was always 
scored 1, and Differential Elevation and its components and Elevation 
were computed using the scoring system where the most proreligious 
answer was always scored l. Results are presented in Table XVIII. In 
this table, there is a consistent pattern of a significant degree of generality 
across the correlation terms of all components, thus suggesting that 
judging ability is, to some degree, a general trait. 

These results indicate that the scoring system may be an important 
artifact in investigations of the generality question when components of 
accuracy scores are used, and therefore Strongly support the argument 
advanced in the hypothetical example discussed earlier, though it is 
still most important to avoid overgeneralization from these results. 


E. EFFECT or VARIED STIMULUS INFORMATION ON JUDGING-ACCURACY 

SCORES 

The next focus was on the fate of the v 

the amount of information 
systematically increased. 

Using the Belief-Values Inventory ( 

standard set of six filmed interviews, one group of 95 judges made their 

judgments on the basis of seeing and hearing the filmed interviews and 


were termed the full information group. A second group consisted of 50 
persons who filled out the Belief-Values Inventory without seeing the 
films at all, only knowing the age, sex, marital status, and number of 
children of the six stimulus persons. The third group consisted of 58 
judges who filled out the same inventory only twice, once as they thought 
it would apply to a typical American adult male and once for 
adult American female. Thus, in these ] 


arious components when 
about the stimulus person or “other” is 


judging instrument) and the 


a typical 
ast two groups the judges were 
required to make judgments solely on the basis of stereotypes, though 
these stereotypes differed in the amount of information on which they 
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were based. The results (see Richards, Cline, and Rardin, 1962) for the 
Interpersonal Accuracy z and Total Error score indicated that the judges 
in the "full information" group were significantly superior both at rank 
ordering the stimulus persons in terms of conventional religious values 


and at predicting exactly the responses of these same stimulus persons. 


Results for Interpersonal Accuracy variance indicated that differentiation 
among stimulus persons increased as the amount of information provided 
about them increased. Contrary to what would be expected, Stereotype 
Accuracy z varied significantly among the three groups while Stereotype 
Accuracy variance decreased as the amount of information increased. 
This probably indicates that Stereotype Accuracy is a measure of con- 
siderable complexity and contrary to previous opinion probably involves 


some degree of sensitivity to individual persons. Additional research is 
needed on this point. 


F. COMPONENTS or JupcGinc-AccuraAcy SCORES APPLIED TO THE CLINI- 


CAL versus STATISTICAL PREDICTION CONTROVERSY 


A controversy relevant to some of the basic issues in interpersonal 


perception relates to clinical versus statistical prediction, Meehl (1955) 
in surveying all studies extant in 


which the relative predictive accuracies 
of clinical and actuarial techn 


iques were compared concluded that no 
published study indicated a clear superiority for clinical procedures. 


More recently, Holt (1958) has suggested that these conclusions resulted 
from the fact that the studies on which they were based involved kinds of 
predictions to which actuarial procedures are better suited than are 
clinical procedures. Richards (1963) has suggested a conceptual scheme to 
clarify this issue using a modification (Cline and Richards, 1960) of the 
analytic model for scores developed by Cronbach 
(1955). 

Cline and Richards (1962) conducted an empirical study to com- 


pare the clinician and the actuary, using a component analysis of the 
judging accuracy scores, They argued that in inv 

accuracy using the component method of 
and particularly its correl 


person perception 


ation subscores, 
of sensitivity to individual differences, Cronb 


ach states that Stereotype 
Accuracy is not a measure of this type of se 


nsitivity, but rather of the 
oup of persons about whom 
Tcumstances, for people in 
all components are important, 
al prediction is to achieve accuracy on 
to individual differences. On the other 
ut, the accuracy of actuarial prediction 


the predictions are made (and 
general). It may be argued that while 
the central goal of “real life” clinic 
components measuring sensitivity 
hand, as Richards (1963) points o 
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can only be due to the fact that actuarial tables have an accurate 
stereotype. Since this is the case, studies which indicate that actuarial 
prediction is superior to clinical prediction must indicate that actuarial 
tables have a more accurate stereotype than clinicians do; in other words, 
a higher score on the Stereotype Accuracy correlation term. This has been 
demonstrated in research (Halbrower, 1955) cited by Meehl (1956) to 
support his position. 

On those components measuring "sensitivity to individual dif- 
ferences,” however, actuarial prediction is at an inherent disadvantage in 
that it must predict all members of a given class of persons to have 
obtained the same score. On those components, therefore, any accuracy 
achieved by actuarial procedures must be due to differences between 
classes. The clinician has no such restriction on his predictions: he can 
easily predict differences within classes, thus increasing the range and 


thereby his possible accuracy on components measuring sensitivity to 
vious that if he does predict differences 


individual differences. It is ob 
within classes, it may either increase or decrease his accuracy. However, 
the author knows of no reason to suppose that it would always result 
in a decrease, and concludes that in any given case it is an empirical 
question whether accuracy is increased or decreased. 

Much of the above was, of course, recognized explicitly by Meehl in 


his original book, but he could find no evidence that the clinician's 


greater freedom resulted in greater accuracy. Probably two factors account 


for this. The first of these is that no studies of clinical versus statistical 
prediction have made either an explicit or implicit comparison of the 
various components of accuracy scores. This is not a criticism, since no 
Such analytic scheme was available at the time most of these studies were 


conducted. The second major reason for Meehl's findings was, in the 


opinion of Cline and Richards, that many, or most, studies comparing 
Clinical and statistical prediction have used measures which reflect mainly 
Stereotype Accuracy, which would favor the actuarial procedures. This 
is particularly clear in the case of the “Q correlation," which is commonly 
used in studies of clinical prediction. Q correlation is a within-persons 
correlation and is therefore based on differences between items. Dif- 
are defined as Stereotype Accuracy both 


ferences between items, however, 
author's modified model. 


in Cronbach's model and in the 
On the basis of the rationale outlined above, Cline and Richards 


developed two hypotheses for the application of this scheme of analysis 
into component scores to the clinical versus statistical prediction studies. 
These hypotheses were that (1) in all studies using an acceptable actuarial 
prediction procedure, actuarial prediction will be superior to clinical on 


the Stereotype Accuracy correlation component; and (2) in some such 
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studies clinical prediction will be Superior to actuarial on the Inter- 
personal Accuracy correlation subscore. The purpose of the present study 
was to obtain data illustrating the use of component scores in a study of 
clinical versus statistical prediction, and to demonstrate that it is 
possible to obtain results favoring actuarial prediction on many of the 
commonly used measures of accuracy, 
Interpersonal Accuracy. 

It is true, of course, that the 
Interpersonal Accuracy can alw 
accurate di 


but favoring clinical prediction on 


accuracy of actuarial prediction on 
ays be increased by more and more 
ion into classes of persons. This, however, does not invali- 
date the basic argument, since it would still be an open question whether 
or not the accuracy of actuarial prediction on this component could be 
increased with reasonable €conomy to a level equal to or higher than the 
accuracy of clinical prediction. Most actuarial prediction procedures have 
used relatively crude classifications, achieved with considerable effort. 

In this study the Ss were 56 students, male and female, from under- 
graduate psychology classes at the University of Utah. While it is 
hazardous to generalize from such a population of naive untrained 
students to a population of highly trained clinical psychologists, this is a 
conservative error, likely to penalize clinical prediction. If this is the 
case, then any superiority over actuarial prediction found for these judges 
can be generalized to trained clinicians. 

In this research, the “clinic 
seen sound color movies of brie 


al predictions” were made after the Ss had 

f interviews of six different persons, These 
films were selected from a large Broup of 25 films on the basis of proce- 
dures resembling item analysis which are reported in detail elsewhere 
(Cline and Richards, 1960). The six films used presented the interviews of 
three males and three females, and the persons interviewed were selected 
so that they were diverse in age and background. With particular regard 
to the present research, they were selected so that they were heterogeneous 


in their self-reported acceptance of conventional religious beliefs and 
practices. The filmed interviews included direct questions about views 
on religion. 


The task for Ss in this experiment was to fill out one judging 
instrument, the Belief-Values Inventory, as they thought the persons who 
were interviewed in the films had filled it out. The Belief-Values Inven- 
tory consisted, as previously mentioned, of Likert-type items dealing with 
personal values in the religion area. There were 12 
six films, and the same 12 were used for every film. 

Six scores for each 5 were computed on th 
Five of these were Total Error Score 


Accuracy variance, Interpersonal Accu 


items for each of the 


€ basis of this instrument. 
$ Stereotype Accuracy z, Stereotype 
1Гасу z, and Interpersonal Accuracy 
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variance. In the present study, the Interpersonal Accuracy z score 
measured primarily the success with which these six persons were rank 
ordered in terms of their over-all “religiosity.” 

The sixth score calculated on the basis of the Belief-Values Inventory 
is called Q-Z, still another expression of correlation in terms of Fisher's z. 
In determining the value of this score for a given judge, the responses to 
the items predicted by the judge for each of the six persons in the films 
were correlated with corresponding actual responses for the six persons 
individually. In other words, it is a “within persons” correlation, highly 
similar to correlations based on Q sorts. These correlations were then 
converted to Fisher’s z and averaged across the six persons. This score 
was used in this study because it corresponds closely to a common measure 
of accuracy of clinical prediction, and therefore makes possible com- 
parisons between the present research and earlier studies of the clinical 
and statistical prediction problem. 

At the start of each judging session, the Ss filled out forms, dealing 
with necessary background information, after which the film of the first 
interview was shown. The projector was stopped, and the Ss made their 
predictions or judgments about the first film. As soon as all Ss completed 
their judgments, the film of the second interview was shown. This pro- 
cedure was repeated until all six films had been shown and all judgments 
completed. 

In previous studies comparing clinical and statistical prediction a 
variety of methods had been used, What these methods appear to have in 
common is the establishment of norms for some group or set of subgroups 
and then the derivation of predictions about members of an independent 
group. Meehl (1955) has suggested a further requirement, that both 
prediction procedures use exactly the same information, but Estes (1961) 
has pointed out that none of the studies cited by Meehl actually met this 
requirement. This "defect" did not prevent Meehl from deriving im- 
portant and legitimate conclusions, and therefore should not prevent 
other investigators from deriving important and legitimate conclusions 
from similar studies. 


This implies that : 
Prediction in this study would be to establish norms for the Values- 


on the basis of an independent group and use these 
onses of the smaller set of persons appearing 
Accordingly, the following procedure was 


used. The group used in defining the norms consisted of 56 persons who 
were interviewed and filmed at two different periods during the course 
ing research project on interpersonal perception. 
of course, include the six used in the present 


the most appropriate procedure to use for statistical 


Belief Inventory 
norms in predicting the resp 
in these filmed interviews. 


of the author's on-go 
The 56 films did not, 
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research. On the basis of a "total religiosity” score on the Belief-Values 
Inventory, these films were divided into two groups on the basis of a 
median split, one "high" on religiosity (i.e., above the median) and one 
“low.” The “total religiosity” score is defined as the sum of the scores on 
the 12 Belief-Values Inventory items when the scoring is adjusted so that 
a high score on each item represents acceptance of conventional religion. 
Norms were then computed for each of these two groups by averaging 
the responses of persons falling into each group to the 12 Belief-Values 
Inventory items. 


The next step was to compute the “total religiosity” score for the six 
films used in the present research. The re 
ing in the films were then “ 


score would have placed them 
group, and predicting that they 
average responses. Two of the 
е the median, and four fell below 


ant thing that should be noted about this pro- 


of similar studies were conducted, the scores for 


ld be subject to sampling variation. Unfortu- 
estimate what this variation would be and use 
atistical tests of significance of difference between clinical predic- 
tion and statistical prediction. This makes it necessary that the statistical 
test used be the ¢ test for significance of deviation from a point of value 
(McNemar, 1955), with the error estimate involving only variation in 
clinical predictions (Le., variation among judges). It should be em- 
phasized that this is not a conservative error, since it would result in 
rejecting a true null hypothesis too often. Accordingly, it was decided to 


require the .01 level of significance before rejecting the null hypothesis 
rather than the conventional .05 level. All ү tests reported in this study 
are two-tail tests. 


nately, it is impossible to 
it in st 


and the actuarial 


be summarized as 
tion procedure had a more accurate 


made the “clinical” judgments, but 


indicating that the statistical predic 
“stereotype” than did the Ss who 
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that the "clinician" Ss were more successful than the statistical prediction 
procedure in judging differences among persons. The comparison of the 
procedures on the variance components as expected indicates that the 
clinicians differentiated much more among persons than did the actuarial 
procedure. These results, taken together, reveal a complex pattern not 
reflected in the global Total Error Score, and thus strongly support the 
usefulness of component scores analysis in studies of clinical and statistical 
prediction. This suggests that the present research may be generalized to 


other such studies. 
TABLE XIX 
COMPARISON OF CLINICAL AND STATISTICAL PREDICTION ON COMPONENTS OF 
ACCURACY SCORES? 


Clinical Standard Statistical 
prediction error of prediction 
Score mean (N = 56) mean score t p 
Total error 
score 1.22 .0334 61 18.26 .001 
Stereotype 
Accuracy Z 1.09 .0494 9.09 20.24 001 
Stereotype 
Accuracy 
variance ES .0267 .39 2.25 NS 
Interpersonal 
Accuracy Z 89 0227 48 7.05 001 
Interpersonal 
Accuracy 
variance 1.20 .0601 45 12.48 001 
QZ 82 0951 89 2.79 01 


" Note: Owing to the fact that no estimate of the sampling variation in statistical 
Prediction is available, the statistical tests would reject a true null hypothesis too 
an > 05 in the significance tests is required. 


often, Therefore, a p > .01 rather th: 

One purpose of this research was to demonstrate that it is possible 
to obtain results favoring actuarial prediction on many of the commonly 
used measures of accuracy, but also favoring clinical prediction on 
Interpersonal Accuracy, and this goal has clearly been achieved. While 
the results of the present study should not be overgeneralized, they should 
not be undergeneralized either. In particular, this statistical prediction 
Procedure, while limited, is comparable to those used in many, or most, 
previous studies of clinical and statistical prediction. 

In addition, the present research clearly suggests that, contrary to 
common opinion among clinicians, the activity at which clinicians are 
Most likely to exceed the actuary is not making predictions about a 
"unique individual" This does not mean that there are no dramatic 
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instances in which a clinician makes successful predictions about a single 
individual by deviating markedly from actuarial expectancy, but rather 
that if the average clinician's major activity is making predictions within 
individual persons, his average accuracy will at best be equal to and at 
worst considerably less than the accuracy of statistical prediction since 
his stereotype tends to be less accurate. On the other hand, the present 
research does suggest that the most appropriate activity for clinicians is 
predicting differences among persons who are grouped into the same 
class by statistical prediction, and it is easy to suggest many ordinary 
activities of clinicians that are of this type. An example of this might be a 
mental hospital in which 10 patients had roughly the same statistically 
derived "predicted benefit from psychotherapy" scores, but which had 
such a limited staff that only three of these patients could actually be 
given psychotherapy. The results of this study suggest that the clinician 
might be quite successful in picking the three “best bets” although all 


10 had been given an “equal” rating by the Statistical prediction pro- 
cedure. 


С. Comparisons Or Groups v 


ERSUS INDIVIDUALS IN MAKING Jube- 
MENTS ABOUT OTHERS 


A question with a somew 
of people can collectively 
individual—was asked. 


hat different direction—whether a group 
make more accurate judgments than a single 


The rationale of this experiment (Cline and 
Richards, 1961) grew out of the recent survey of studies comparing group 


performance and individual performance made by Lorge, Fox, Davitz, 
and Brenner (1958). The general conclusion of this survey was that @ 
group, on almost any task, will perform better than a typical individual, 
but not necessarily better than a superior individual on the task in 
question. This finding is true whether the “group performance” is made 
by a genuine group or is merely a statistical combination of several 
independent individual performances, An unresolved question is the 


degree to which these findings can be attributed to a reduction in the 
variability of the group perform 
The trend of the studies 


to be tested in this experimen 


ance. 


cited in this Survey suggested the hypothesis 


t. This hypothesis was the following: 
The accuracy of predictions (about the bel 


havior of other persons) 
made by a group of persons arriving at a Consensus prediction through 
group discussion will be significantly Breater than the average accuracy of 
the predictions made by the indiy 


iduals composing the group. The aver- 
age accuracy of the prediction made by the individuals composing the 


group will also be significantly less than the accuracy of an “artificial 
group” (composed of pooled independent judgments for each item) and 


ч 


INTERPERSONAL PERCEPTION 26 


also less than the accuracy of prediction of the best individual among the 
individuals composing the group. 

A secondary question relates to the presence or absence of a con- 
sistent pattern of superiority in accuracy among predictions made by the 
best individual judges, consensus groups, and artificial groups composed 
of pooled independent judgments for each item. 

The subjects were 186 students, both male and female, in the intro- 
ductory psychology clases at the University of Utah. The procedure 
involved the presentation of six filmed interviews or "standard others.” 

In this study, two prediction instruments were used. The first of 
these was the Adjective Check (ACL), which required the subject to 
determine which of a pair of adjectives the interviewee had checked as 
being descriptive of himself, There were 20 such pairs for each of the 
six films, making a total of 120. The score on the ACL was the number 
correct. Thus the ACL is similar to a forced-choice rating procedure. 
The second instrument used was the Belief-Values Inventory (BVI), 
already discussed. 

There were | 
scores were computed from j 
program developed for the II 
total score, which was based on the 


9 ВУ] items for each film or interview. Several different 
udges' responses to this instrument. using a 
3M 650 computer. The first of these was a 
average of the squared discrepancies 
(using the one-to-five point scale) between predicted responses by each 
judge for each interviewee, and actual responses of each interviewee. 
This is an error score and in order to make these scores comparable with 
Other scores used in this study, the scores were converted to accuracy 
scores through a standard score transformation, setting the mean equal 
to 50 and standard deviation equal to 10. 

The second two BVI scores are components of what Cronbach (1955) 
has called "Stereotype Accuracy": (2) correlation between each judge's 
and obtained item means, converted to a Fisher's z, 
ach judge's predicted means. Cronbach has 
the two parameters in Stereotype 
nt, and they permit independ- 
accuracy and on variability of 


predicted item means 
and (b) the variance of е 
demonstrated these two scores to be 
Accuracy when the criterion is held constar 
ent evaluation of the effect of grouping on 
prediction in this study. The last two scores on the BVI are measures ol 
Interpersonal Accuracy. 

The 186 subjects in this exp : 
person groups. The division was made at the time the experiment was 
conducted, and most groups cons мей of three persons seated next to 
each other in the experimental room. Group composition in terms of sex 
of group members was roughly random. The 55 saw each film and then 
completed the judging instruments. independently. Next, they joined 


eriment were divided into 62 three- 
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together in a group discussion fashion and proceeded to arrive ata cone 
sensus judgment for the items on the judging instruments without refer- 
ring back to their earlier independent judgments. 

The "artificial group" judgment (pooled independent judgments for 
each item) was derived from the individual judgments of the group 
members. Thus, on the ACL, the artificial group judgment was deter- 
mined on the basis of a "majority vote" of the judges on each item 
(by inspecting their individual judging protocols). On the BVI, it was 
calculated by determining the average of the values predicted by the 
three judges for each interviewee on each item. It is important to 
emphasize that this artificial group is only a statistical combination of 
the original independent judgments for each item. 

The average accuracy of individu 
obtained by computing the mean of tł 
three individuals who made up e 
artificial group procedure where 


the three group members were av 
Scores. 


als composing the group was 
ne total accuracy scores of the 
ach group. This is not the same as the 
the actual item-by-item predictions of 
eraged rather than their total accuracy 


'The best judge in each group was selected on the basis of his 
accuracy scores. In interpreting the results of this study, therefore, it is 
important to note that this selection was done on an after-the-fact basis, 
thus maximizing accuracy scores for this condition by capitalizing on 
chance. It would, therefore, be impossible for a best judge selected in 
advance to obtain a higher score than this, and such a best judge would, 
in fact, probably score somewhat lower, since some error would be 
involved in any advance selection. The best judges were selected in- 
dependently for the ACL and the BVI and therefore were not necessarily 
the same person on the two different instruments, On the BVI, however, 
the best judges, selected on the basis of total Score, were also used as 


best judges in making the comparisons involving the other scores derived 
from this instrument. 


The mean and Standard deviations for each 


each judgment score are presented in T 
ате accuracy scores. Since tot 


judgment procedure on 
able XX. In Table XX, all scores 


Sec п BVI is based on error score, in 
Table XX this judgment score 15 transformed to a standard score distribu- 


tion with mean — 50 and standard deviation — 10. 

Аз a first step in the statistical analysis of these data, over-all F tests 
were calculated for each of the judgment scores Separately. The results of 
this analysis are presented in Table XXI. No test for homogeneity of 
variance was made before calculation of these F tests. This procedure was 
followed because the recent work of Boneau 


URGE (1960) strongly suggests that 
F is not significantly affected by heterogeneity of variance if the sample 


al score o 
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TABLE XX 
MEANS AND STANDARD DEVIATIONS OF JUDGMENT SCORES 


Artificial group 


Average of Three- derived by 
individuals person pooling three 
composing Best group independent 
the group judge consensus judgments 
Adjective Check List 
Е: 97.27 101.66 102.52 103.32 
ы 3.51 3.91 3.95 4.55 
Belief-Values Inventory 
Total 
x 43.29 53.92 4947 52.87 
LÀ 8.61 8.31 11.16 8.02 
Stereotype Accuracy z 
X 1.19 1.44 1.28 141 
ы .28 37 45 57 
Stereotype Accuracy 
variance 
X 35 40 31 80 
i 214 22 15 43 
Interpersonal Accuracy z 
x 90 1.01 1.00 98 
= 12 14 16 15 
Interpersonal Accuracy 
variance 
х 1.09 1.06 1.06 91 
= 93 28 31 25 
TABLE XXI 
RESULTS OF OVER-ALL F TESTS FOR JUDGMENT SCORES 
Between- Withi 
variance variance 
Judgment score (df = 3) (df = 244) F 
Adjective Check. List 
Total 451.82 13.82 32.62a 
Belief-Values Inventory 
Inventory total 1423.02 84.33 16.872 
Stereotype Accuracy z .8633 .1432 6.03« 
Stereotype Accuracy variance 1333 0282 4,720 
Interpersonal Accuracy z .1633 :0213 7.67a 
Interpersonal Accuracy variance .3900 0754 5.175 
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sizes are identical and relatively large, i.e., 20. Both of these conditions 
hold in the present study. It is also known that available tests for homo- 
geneity of variance are affected too much by other variables than that 
involved in the null hypothesis to justify their use prior to an analysis of 
variance (Box, 1953). 

Since all of the F tests in Table XXI are significant at or beyond 
the .01 level of confidence, a test for significance of difference between 
individual means was made. This test was made using the multiple range 
test (Li, 1957, p. 238), which is an appropriate procedure for making 
“post-mortem” type comparisons between individual means after an 
over-all F test has been made. Briefly, the multiple range test involves 
computing a value which represents how large the difference between two 
means must be in order to be significant at a stated level, and then com- 
paring the obtained difference to this value. Results of this analysis are 
summarized in Table XXII. 

On each of the four accuracy measures, the best judge and both group 
judgments are significantly superior to the average of the individuals 
composing the group. Thus the major hypothesis of this experiment was 
confirmed. There is no consistent pattern of significant differences among 
the first three procedures mentioned above. As would be expected, on the 
two scores representing the amount of variability in predictions, the 
artificial group mean tends to be lower than the means of the other three 
procedures. This tendency is significant, however, only for the Inter- 
personal Accuracy variance score. It is somewhat surprising to find that 
the artificial group (or pooled independent judgments of items) is 
superior to the best judge on the ACL. The interpretation of this finding 
seems to be that if both other judges disagree with the best judge, they 
are more likely to be right than is the best judge. If, on the other hand, 
only one of the other judges disagrees with the best judge, he is more 
likely to be wrong than is the best judge. 

This study clearly implies that satisfactory ratings are least likely to 
be obtained from a single unselected individual. In exploring further 
implications of these results for an operational rating setup, several other 
considerations enter. The first of these is that typically the best judge 
Would be difficult to select on ап а priori basis, and (because of selection 
error) best judges selected a priori would probably score lower than the 
best judges used in this study. Since each of the group procedures pro- 
duces results roughly equivalent to those for the best judge selected on 
an after-the-fact basis, an extensive (and expensive) effort to identify best 
judges and use them as raters would appear to be unnecessary. 

The second consideration involved in applying these results is that 


by far the most time in this experiment was consumed in arriving at 
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consensus judgments through group discussion, a finding which one would 
certainly expect to generalize to other situations. Since the artificial group 
(or pooled independent judgments of items) procedure produced results 
as good as or better than the results produced by the consensus judgment, 
and required much less time, it would appear to be most appropriate 
when accuracy and time are both considered. Thus, the best procedure for 
using ratings in many applied situations would be to obtain several 
independent ratings from different raters for each ratee, and then combine 
these ratings statistically into a single rating. It should be noted, however, 
that the superiority of the artificial group in terms of time required 
(and therefore expense) might disappear if only a single summary rating 
were required rather than the many relatively specific judgments re- 
quired by the experimental procedure used in this study. н 

A limitation to these conclusions is the fact that each rater in this 
experiment was basing his ratings on the same or identical information 
(1.е., seeing the same movies of the interviews). If different raters are 
basing their ratings on different information, some other procedure 
involving the sharing of this information might be superior. 

In addition to the practical implications outlined above, these 
results present at least two more additions to previous psychological 
research. The first of these is the demonstration through both the 
Stereotype Accuracy correlation term and the Interpersonal Accuracy 
correlation term of the BVI that accuracy is increased through grouping 
independent of a reduction in variability (see Table XX). Unlike the 
other results of this experiment, this would not necessarily be expected 
on the basis of Previous studies comparing §roup and individual per- 
formance, although it certainly is consistent with previous studies. The 
second major addition is related to the current Controversy in the inter- 
personal perception literature over the relative merits of various different 
types of accuracy scores (Cronbach, 1955). In the current study the total 
Score on the ACL, the total score on the BVI, and the Stereotype 
Accuracy and Interpersonal Accuracy correlation terms all gave con- 
sistent results and, more important, results w 


; hich make sense in terms of 
previous research comparing group anq indiv 


idual performance. 
H. A FACTOR ANALYTIC STUDY ор THE INTERPERSONAL ACCURACY 

COMPONENT 

The Interpersonal Accuracy judging score component has been 
discussed previously and used in a variety of settings. As already men- 
tioned the correlation term of this component is regarded as most closely 
related to what is ordinarily meant by accuracy of interpersonal percep- 
tion. This term is computed by determining the correlation between 
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predicted, or judged, scores on individual items, and then averaging 
across items after transforming the item correlations to Fisher's z values. 
Where a strong general factor underlies the items, such as religiosity, this 
component measures the extent to which the persons judged are rank 
ordered accurately on that factor. Review of the results of Cline and 
Richard’s various studies of this component has proven them complex 
and at times confusing. While it has responded at the group differences 
level to experimental manipulation in meaningful ways, it has tended to 
show little generality at the individual differences level across different 
measures (i.e., different sets of items or persons judged). 

One possible interpretation of this pattern of results is that this 
component has validity but not reliability. Such a paradoxical result 
would be possible if the component were factorially complex, and this 
possibility suggested the hypothesis for the present study. This hypothesis 
is that when the individual item z's are treated as scores, intercorrelated, 
and rotated to an analytic solution, there will be several group factors 
rather than a strong general factor. 

In order to determine whether this might be the case, the following 
study was undertaken in which 129 University undergraduate students 
participated as Ss. They were shown the standard six filmed interviews 
and given the judging tests (for each film), each of which was susceptible 
to an analysis in terms of components. The titles of the judging instru- 
ments and the characteristics underlying their item content are given 
below: 

(1) Personal Practice Questionnaire (participation in religious 
activities), „ри 

(2) Activities Preference Inventory (femininity of interests), 

(3) Interpersonal Relations Inventory (socialization). 

All of the items from these scales were put into a Likert format. On 
the Activities Preference Inventory and the Interpersonal Relations 
Inventory, five responses were provided for each item, ranging trom 
"Strongly Agree" to “Strongly Disagree." Scores from 1 to 5 were assigned 
to these responses so that "Strongly Disagree" always received a score of 5. 
On the Personal Practices Questionnaire four responses were provided for 
Sach item varying from "Frequently" to "Never." Scores from l to 4 
Were assigned to these responses so that "Never" always received a score 
of 4. There were ten Likert items per measure. 

For each $, his scores on a given instrument were his 10 individual 
item Interpersonal Accuracy correlation values. In other words, on each 
item his score was the Z conversion of the correlation between his pre- 
dicted scores for the six stimulus persons on that item and their cor- 
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responding "true" scores. It was thus a measure of his success in rank 
ordering the stimulus persons on that item. | 

"These scores were intercorrelated and factor analyzed for each instru- 

ment separately using a principal components solution and varimax 

orthogonal rotation. Unity was placed in the diagonal and all components 
TABLE XXIII 

ROTATED Loapincs FOR FACTOR ANALYSIS OF INDIVIDUAL Ii 

PERSONAL PRACTICES QUESTION 


EM ACCURACY SCORES ON THE 
AIREG 


Factors 
" n2 
Item A B [e 
l. Within the past few years I. have attended " 
religious services. 80 —.96 —.04 71 
2. I have taught “Sunday School” or given 
another religious class or group within the 
last two years, :00 55 18 3 
3. Within the last two years I have invited 
Someone not of my faith to attend religious 
services with me. 69 —05 18 5l 
4, Within the last few years I have discussed 
religious topics with my friends. 34 — 33 21 21 
5. Within the past two years I have prayed. 22 79 —04 pF 
6. In my family we have the Practice of having 
family prayer. 53 05 32 39 
7. Within the Past two years I have spent 
periods of time in private religious thought 
and meditation, —295 65 =й0 50 
8. Within the past two years I have read the 
Holy Scriptures, bo uq; 16 63 
9. Within the past two years I have read books, 
magazine articles, etc, with religious themes 
or "inspirational messages." 04 93 79 68 
10. Within the past two years I have Biven 
money (or donations) to a church ог 
religious group. 64 20 45 


* Note. For 
"Never. 


h item four choices were 


provided ranging from "Frequently" to 


with an eigenvalue Breater tha 
detail the rationale for this fac 

The rotated factor lo 
Practices Questionnaire 


n 1.00 extracted. Kaiser (1960) presents in 


able XXIII, for the Activities 
and for the Interpersonal Relations 
Inventory in Table XXV. 


The implications of these results are clear, Interpersonal Accuracy 
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is a complex measure, involving several orthogonal factors rather than a 
large general factor and the factorial complexity is a much more im- 
portant finding than the details of the individual factors. This indicates 
that accuracy probably should not be interpreted as a single unidimen- 
sional trait, but rather as a group of relatively independent traits. It 
should be emphasized that even though this is true one can still talk 


TABLE XXIV 
ROTATED LoApINGs FoR FACTOR. ANALYSIS OF INDIVIDUAL IrEM Accuracy SCORES ON THE 
ACTIVITIES PREFERENCE INVENTORY® 


Factors 
Item A B C D h2 
l. I would like the work of a clerk 
in a large department store. 54 —37 —23 —16 51 
2. I like adventure storics better than 
romantic stories. —.ll 58 24 29 49 
3. I would like to be a soldier or a 
WAC. —.09 08 —.56 05 33 
4. I very much like hunting. —.02 03 —.19 —76 62 
5. 1 think I would like the work of 
aaa. —04 —03 —22 67 50 
6. I think I would like to be a pro- 
fessional golfer. —12 —10 66 04 46 
7. I think I would like the work of 
a professional musician. —74 42 —.12 09 75 
8. I think I could do better than 
most of the present politicians if 
I were i OIE 42 —68 07 23 59 
9. I get excited very easily. 04 45 54 —04 48 
10. The rage person is not able to 
—Л75 RU —.06 —.14 61 


appreciate art and music very well. 
provided for each item varying from “Strongly Agree” 


% Note: Five choices were 
to "Strongly Disagree.” 
meaningfully about differences in over-all ability to judge others accu- 
rately. In order to do this, however, one must interpret variations in 
over-all accuracy as variations in the elevation of a profile (Cronbach and 
Gleser, 1953). This is a very different conception from variation along a 
single dimension, and one with important implications for future 
Tesearch on accuracy of interpersonal perception. At the very least, it 
implies that one cannot expect to adequately describe a person's accuracy 
of judging others with a single score. For example, if variations in over-all 
accuracy are conceived of as variations in profile elevations, one must also 
consider other characteristics of profiles such as scatter and shape. 
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Finally it should be noted that the scores for each item in the factor 

analysis were accuracy measures. These factors are not necessarily the 

i zi T s 

same as the factors that would be obtained by analyzing the ещ 

of the persons judged to these items. In other words, the dimensions o 
TABLE XXV 


ALYSIS OF INDIVIDU. 
INTERPERSONAL RELATIONS I 


А Y г: $ THE 
ROTATED Loapincs ғов FACTOR Ax AL ITEM Accuracy SCORES ON ТІ 


NVENTORY 


Factors 
Item А B C D he 
1. When I get bored I like to stir up 
some excitement, 09 06  —80  —1 б 
2 


2. I would do almost anything on a 


dare. 62 — —4 —31 24 56 
3. I think I am stricter about right 

and wrong than most people. -— MET 38 —19 54 
4. It is all right to get around the d 

law if you don't actually break it. —.6 19 30 60 8 
5. I often act оп the spur of the 

moment without stopping to think, 19 —3] —31 69 65 
6. I have often Bone against my 

parents' wishes, 71 — —12 8 06 “ 
7. My home life was always happy. —.05 56 —.08 15 M 
8. Before I do Something, 1 try to 

consider how my friends will react 

to it, J7 — 09 йй 45 
9. I seem to do things I regret more 

often than most people. —27 —.28 —.02 —48 38 
10. Every family ow 


es it to the city 
ewalks cleared in 
heir lawns mowed 


to keep their sid 
the winter and th 
in the summer, 


? Note: Five choices were 
trongly Disagree,” 


Provided for 


pach item у, 


arying from “Strongly Agree 


accuracy of judged religiosity are not necessarily the same as the dimen- 
sions of religiosity itself, 


1. EFFECT or SociaL Dest 


RABILITY Res 
ACCURACY 


PONSE SET ох JUDGING 


A study was conducted 
on the effects of different kj 


three motive Strengths or in 
(a) Low incentives: 
reward was provided. (b) 


by Cline and Richards (1964 Annual Report) 
nds of motivation on judging accuracy where 
centive conditions Were used with 65 Ss. 

The Ss were urged to do their best but no 
Medium incentive: Subjects were told that the 


N 
- 
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most accurate judge would receive a S15 prize to be mailed out in several 
weeks. (c) High incentive: Here the subjects were told that the best 
judge would receive a prize of $25; the second best, $15; the third, fourth, 
and fifth, $10; and the sixth through tenth best judges would each receive 
$5. They were told that they would receive these prizes a few minutes after 
the experiment was completed and the money was displayed in full 
view at the front of the room while the experiment was taking place. 
When the results yielded negligible differences in judging accuracy 
between the experimental groups, the researchers speculated that this 
might be accounted for by some uncontrolled response set affecting the 
judging instruments. However, in the construction of these instruments 
attempts had been made to control for the two most obvious response 
sets, namely always choosing “True” (or "False"), and always choosing 
the socially desirable or (undesirable) response. However, no evidence 
had been available to indicate whether this had been successfully ac- 
complished or not. That this should be checked is suggested by the ex- 


perience of Edwards with his Personal Preference Schedule: he thought 


that he had controlled for social desirability, but in fact had not (Corah 


et al., 1958). 

Accordingly, special keys were developed to give evidence about these 
matters. 

The first step in constructing these keys was to obtain ratings of the 
Socially desirable answer to each item of each judging instrument. These 
ratings were made by four persons, one Ph.D. psychologist and three 
advanced graduate students in psychology. The criterion for determining 
the socially desirable answer to an item was that at least three out of the 
four raters agree that the answer in question was the socially desirable 
One. If there was a two-two split among the raters as to the socially 
desirable answer, it was concluded that social desirability had been con- 


trolled on that item. 
Keys were then cons 


all possible combinations of correct respons 
sponse. In other words, for the Behavior Postdiction and the Opinion 


Prediction tests, the keys were: (а) socially desirable answer true, cor- 
Tect answer true; (b) socially desirable answer true, correct answer false; 
(с) socially desirable answer false, correct answer true; and (d) socially 
desirable answer false, correct answer false. For the Adjective Check, the 
keys were (a) socially desirable alternative correct, and (b) socially desir- 
able alternative wrong. On each of these keys, the score is the total num- 
ber correct, Another key for each of the three instruments consisted of 


those items where social desirability was controlled (regardless of whether 
“true” or "false"). Again the score was the total 


tructed to make it possible to analyze separately 
and socially desirable re- 


the correct answers were 


278 VICTOR B. CLINE 


number correct. Finally, another special key, the Popular Responses key, 
was constructed on the basis of the sample reported in the author's 1962 
Annual Report (Cline, Richards, and Abe, 1962), who took this same 
judging test. A "popular response" was defined as an answer chosen by 
80% of the sample regardless of whether that response was correct or not. 


TABLE XXVI 
CORRELATIONS AMONG VARIOUS KEYS WITHIN JUDGING INSTRUMENTS 


Instrument Key 1 2 3 4 


Behavior l. Socially desirable answer True, 
Postdiction correct answer True 
Socially desirable answer True, 
correct answer False —94 
3. Socially desirable answer False, 
correct answer True 27 27 
4. Socially desirable answer False, 
Correct answer False 39 —35 —08 9 


ю 


Opinion l. Socially desirable answer True, 
Prediction correct answer True 

Socially desirable answer Truc, 
Correct answer False —38 
3. Socially desirable answer False, 

correct answer True —12 21 
4. Socially desirable answer False, 

correct answer False 13 


ю 


—58 01 vx 
Adjective l. Socially desirable answer Correct 
Check 2. Socially desirable answer Wrong —39 


TABLE XXVII 


CORRELATIONS AMONG TOTAL Correct Krys FOR THE THREE JUDGING INSTRUMENTS 
1 2 3 
l. Behavior Postdiction = 
2. Opinion Prediction —03 - 
3. Adjective Check 14 00. -ë 


The score on this key was the total number of popul 


; ar responses chosen. 
The hypothesis emerged that the Ss in maki 


ned by computing correla- 
f social desirability were а 
the correlation between keys where 
same and the correct answer dif- 
le, there would be a negative cor- 
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relation between the "socially desirable answer True, correct answer 
True” key and the “socially desirable answer True, correct answer False" 
key. If accuracy were more important than social desirability, these cor- 
relations would be positive. 

The correlations among the various keys within instruments are 
presented in Table XXVI. In order to permit a further comparison of 
social desirability with accuracy, correlations were also computed among 
the Total Correct keys for the three judging instruments. These correla- 
tions are presented in Table XXVII. The results indicate quite clearly 
that variation with respect to tendency to choose the socially desirable 
response is more important in these judging instruments than variation 
in accuracy—at least for this study. This is true in spite of the fact that, 
as previously indicated, efforts had been made to control this response set. 
And while this is just one study, it does demonstrate the problems one 
encounters in attempting to develop judging instruments tailored to 
individual “others.” 

While it cannot be proved, the "social desirability” variable might 
also account, in part, for some of the generality found across judging 
instruments and “others” reported earlier in this paper. 

One possible solution to this problem would be to use a paired 


Comparison method. Use of this technique with filmed interviews 
at Ss would be shown separate movies 


which of these two 


Stimulus material would mean th 
Of interviews of, say, two persons and then asked 
more intelligent, dominant, neurotic, etc. 'This would 


also make sense in terms of the findings of clinical versus statistical pre- 
dictions study where the clinician-judge can best beat the actuary by 
ranking, ordering, or comparing several others at one time. 


Persons were, say, 


V. Summary 


In the middle 1950's Cronbach, Gage, and several other researchers 
Criticized naive empiricism in interpersonal perception research. They 
Pointed out serious weaknesses and flaws in most published studies as 
Well as in traditional methodology common to much research in this 

lown" global judging 


area. The possibility of fractionating or "breaking d i 
a possible solution or way out of this problem. 


Scores was suggested as . 
"pure" accuracy scores un- 


This would allow experimenters to obtain i 
Contaminated by other components ofttimes working au cross purposes 
and in contradictory ways when combined into a global index or measure. 
In a series of researches by Cline and Richards, the power and utility of 
this component approach was demonstrated and helped clarify some of 
the issues in the clinical versus statistical prediction controversy, prob- 
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lems with the generality of empathic ability, the judging accuracy e 
groups versus individuals, and so forth. Despite this, these components 
still revealed some weaknesses and under certain conditions produced 
conflicting and contradictory results. It was demonstrated that the nature 


of the scoring system could affect these components in undesirable ways 
and in certain situations lead to unjusti 


à new Interpersonal Accuracy 
which was believed free of the 
Differential Accuracy and othe 
IA component revealed that (а 


fied conclusions. Because of this, 
(14) measure (component) was developed 
methodological shortcomings of the older 
T components, Factor studies of this new 
ѕ а "pure" measure of judging ability) it 
was factorially complex and not unidimensional. It involved several 
orthogonal factors rather than a large general factor. However, eves 
though this was true, it was believed that one could still talk meaning- 
fully about differences in overall ability to judge others accurately. In 
order to do this, however, one must interpret variations in over-all judg- 
ing accuracy as variations in elevation of à profile where scatter and shape 
must be considered. 

It was also demonstrated that the social 
extremely powerful influence in the use of 
must be considered or reckoned with in the 
as on the criterion development side of the 


The use of stereotypes in judging was found, also, to be an extremely 
important variable in judging. Much evidence suggested that having an 
accurate stereotype of a certain class or subclass of social objects or 


"others" being judged could, apart from any other variable, contribute 
significantly to accuracy, 


desirability factor exerts an 
all judging instruments and 
actual judging process as well 
problem. 


However, it was noted at the date of writing this review that there 


still does not exist a fully Satisfactory and validated test or procedure 
for measuring judging accuracy. And certainly one of the major unsolved 


problems is the lack of a good external criterion with which we might 
validate the new experimental judging tes 


developed. This is a situation not unlike th 
the first tests of intelligence were constructed. 

Because of the many methodological problems encountered where 
single persons or "others" are judged individu 
made that the judging task could be gre 
compare pairs or rank-order triads of "oth 
variables. Thus the only discrimination 
Which of two "others" was the most dor 
might rank-order three "others" 
three films of them or observing 


ts which are occasionally 
at which Binet faced when 


ally, the suggestion Was 
atly simplified by having one 
ers” on a variety of personality 
Tequired would be to indicate 
minant, intelligent, etc. Or one 
on these same traits after first seeing all 
them in interviews etc, This would solve 
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the problem of projection, social desirability, and idiosyncracies of the 
judges in their approach to standard rating instruments. 

Experience has suggested that filmed interviews of “others” have to 
be considered as something akin to unselected items in a test. Some will 
prove confusing, too difficult, or too easy and will have to be discarded. 
And the same is true for the material in the individual judging instru- 
ments. In the author's experience a considerable amount of time and 
energy must be spent in refining and developing these for even a mini- 
mum payoff. 

Since the author's bias is in the direction of studying "ultimate 
achievement" in judging, he would be of the opinion that the "best" 
judging measure ultimately will prove to be a sophisticated global index 
which will combine all the crucial elements contributing to accuracy in 
à measurable and meaningful way. This might involve developing five 
Or six factorially pure tests, or a procedure similar to that used by Guil- 
ford to measure dimensions of intellect, or the much cruder Wechsler 
Adult Intelligence Scale where various subtest scores are combined to 
give a Full Scale IQ, or it might possibly even make use of a composite 
of a profile. 

In any event, the current status of interpersonal perception research 
15 not yet in sharp focus. А fully satisfactory measure of judging accuracy 
still eludes us. 
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I. Introduction: Stimulus Factors in Projective Measures 


А. VARIANCE-ĪNVARIANCE IN PERCEPTION 


, The fact that stimuli facilitate and inhibit behavior is not without 
Prior attention in the field of psychology. Ever since the early beginnings 
ОЁ behaviorism at the turn of this century, it has been generally assumed 
that behavior involves some interaction with stimulus situations. In fact, 
Some psychologists have been so bold as to propose that such interactions 
Constitute the empirical data of psychology. Doubtlessly here lies the 
basis for the protopostulate of many researchers that an intensive analysis 


! On leave of absence from the Department of Psychology, University of British 
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of the role of stimulus functions in behavior will yield findings of con- 
siderable theoretical and practical import for the discipline of psychology. 
Underlying most conceptions of stimulus functions is the broad proposi- 
tion that the stimulus is the entering wedge into a behavior setting. The 
central problem or puzzle for psychology emerges when we attempt to 
find out what happens after an individual is stimulated. | 

A significant finding to emerge from early studies on the interaction 
between stimulation and perceiving was the clear-cut evidence that 
perception is not completely tied to stimulation at the sensory surface. 
Perhaps the classic experimental demonstrations of this lack of invariance 
between sensory input and perception are the phi phenomenon and 
reversible figures. The data on ambiguous figures immediately face us 
with the problem and the possibility that percepts may change without 
any alteration in stimulation, proximal or distal From a historical 


perspective, such phenomena paved the way for the acceptance of the 


belief that input stimulation is only one of the many possible variables 
determining perception and behavior. 


Such data and other interlocking findings show that we do not 


perceive directly either the distal stimulus or the proximal stimulus 
(cf. Brunswik, 1944, 1952, 1955). The former term may simply be defined 
аз the physical object in the objective environment. The energy changes 
impinging on receptors refer to proximal stimulation, and they constitute 
the initial Starting point or occasion for perception. However, if we 
perceive neither the distal object nor the proximal stimuli, then what 
do we perceive? It is at this juncture in the Sequence of perceptual events 
that theory starts to take over. As Woodworth (1958) formulated in his 
characteristically middle-of-the-road fashion 

late that the individual decodes the r 


objective or distal terms and then encodes the consummatory response 
before translating it into peripheral action, Hence, perception is the 
utilization of proximal stimuli as indicators of physical or social objects 
in the objective environment; in short, perception is a translation OT 
decoding of proximal stimulation into distal objects. 

Since the decoding of stimulation is an inferential process, veridical 
perception depends upon assigning the stimulation to the appropriate 
categories which bear correspondence to dista] objects. If the inferential 
process decodes the proximal stimuli into an inappropriate category OY 
code, then perception will be nonveridical, апа behavior may be in- 
appropriate. The two challenging problems in perception are to discover 
the empirical laws as to how proximal stimuli are decoded and how the 
codes are initially developed. To the extent that the formation and 
utilization of codes may possess individual consistencies, then these two 


s x 
1, IL seems necessary to postt 
aw proximal stimulation into 
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problems are of interest to both the perceptual theorist and the person- 
ologist. 


B. STIMULUS FUNCTIONS IN EARLY PROJECTIVE MODELS 


The foregoing oversimplified and sketchy comments on the decoding 
or processing of stimuli lead us to the launching platform of projective 
techniques. They had their origins in empirical findings dealing with 
the relative lack of correspondence between sensory stimulation and 
perception, and in individual consistencies in perceiving. The effective 
application of these findings to the construction of projective techniques, 
however, depended critically upon the early theorizing of Freud and the 
Gestalt psychologists concerning the role of organismic variables in deter- 
mining what was perceived at the sensory surface. 

By and large, the early projective test constructors and interpreters 
were influenced profoundly by the psychoanalytic concepts of primary 
and secondary processes and the role they played in coping and adapting 
to focal and incidental stimulation. Freud’s notions on cathexes and 
hypercathexes of stimuli (cf. Holt, 1962) also provided a point of contact 
between projective test constructors and psychoanalysis. The unstructured 
Stimuli of projective techniques were thought of as devices for reducing 
the reality base of stimulation, thereby permitting and encouraging 
Primary process material to become dominant. The unclear stimuli of 
Projective measures presumably interfere with the process of hyper- 
cathexis. What is equally important, the permissive test instructions and 
the general context of the test situation add further difficulty for the 
material being hypercathected. Or, putting these statements at the other 
-secondary process continuum, it was believed 
by early constructors of projective techniques that ambiguous stimuli, in 
a context of permissive instructions and test situations, weakened or 
‘paired reality-oriented secondary process thought. Veridical percep- 
tions would fail as a consequence of difficulties in the hypercathexis of 
Unstructured stimuli. 

However, the veridicality of perception also had to be dealt with by 
Projective personologists. While Rorschach provided the penetrating 
Insight that the manner of perceiving was а function of intrapsychic 
functions, he did not attempt to develop any systematic theory to 
explicate his perceptual findings. This conceptual task was left to others. 
Unfortunately, as Lindzey (1961) has shown, this assignment is still far 


Tom completed. 
Endeavors to explain 
ave most frequently invoked structura us 
Perceptual determination. Extrapolation of gestalt principle 


end of the primary process 


the stability of perceptions in projective tests 
1 or autochthonous principles of 
s of per- 
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ceptual organization has been primarily limited to the Rorschach test 
(Arnheim, 1951, 1954; Beck, 1933; Belden and Baughman, 1954; Berliner, 
1955; Brosin and Fromm, 1949; Klein and Arnheim, 1953; Rickers- 
Ovsiankina, 1960; Stein, 1951; апа Wertheimer, 1957). Despite the fact 
that many authors have proposed that projective techniques have affinities 
to gestalt principles, the application of these principles to such tests 15 
by no means clear. The main shortcoming of the theorizing relating 
principles of perceptual organization to projective techniques is that 1t 
has been very casual and has not led to much f 


ruitful experimentation. 
The essentials of a gestalt explan 


ation of projective percepts, based 
in large part upon Wertheimer's (1957) discussion of the relations 
between the Rorschach and gestalt laws of organization, starts with the 
notion that a projective stimulus is a multiple reversible figure, with 


the autochthonous factors setting limits on the influence of intrapsychic 
factors. If the autochthonous variables 


sonality parameters will play only 
percepts and cognitions. On the 
structural determinants ( 


are excessively strong, then per- 
à negligible part in the formation of 
other hand, stimuli that are weak an 
ambiguous stimuli) will maximize personality 
processes in perception and cognition. In spite of 
however, the principles of perceptual organizati 


effects on projective responses. The specific consequences would vary 
within a particular projective technique, especially for any test with test 
items varying in degree of stimulus ambiguity. 

Within these two boundary 


ambiguous test material, 
on would still have their 


conditions, the gestalt principles of 
perceptual stabilization have been assumed to Operate as follows: 


l. Law of good continuation, Perceptual structuring of projective 
stimuli will eliminate the irregularities in the stimulus material. For 
example, an irregular round object on the Rorschach may be perceived as 
a bowling ball, just as small parts may be perceptually omitted to provide 
a good gestalt. 

2. Law of similarity, Simil arouse perceptual processes 
leading them to be perceptually united or grouped. It is probable that 
similar forms and similar colors are the two most important similarity 


factors which lead to perceptual Separation of the Rorschach cards into 
major and minor details. 


З. Law of closure. Incom ЛШ cre: perceptual processes 
that will induce complete figures. For example, contour gaps on the 
Rorschach should elicit absolute or partial completion, thereby creating 
unitary percepts. Difficulties j 


x n closure may also appear on the Rorschach, 
e.g., the incomplete closure at the top of Card VII. 
4. Law of proximity. Percepts w 


parts of a projective stimulus that 


ar stimuli will 


; га] 
ill be formed between structur 
are near one another. This principle; 
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in conjunction with that of symmetry, helps to divide the inkblots into 
their major details. The interaction of these two principles is seen most 
clearly at work on Card X of the Rorschach, where whole responses are 
difficult to form. 

5. Law of symmetry. Symmetrical aspects of the projective stimulus 
Will facilitate perceptual grouping. Symmetry of shape undoubtedly 
plays a critical role in form accuracy on the Rorschach. 

While these structural laws of perception are assumed to apply to all 
individuals and to all stimuli, it may be hypothesized that their relative 
influence will vary with the structuredness of the stimulus material. From 
this assumption, it follows that inkblot stimuli weak in autochthonous 
factors would increase the frequency of color-form percepts and vague 
whole responses and decrease form plus percepts and popular responses. 
These predictions rest on the implicit assumption that form dominant 
prägnant percepts reflect autochthonous grouping principles. The testing 
of these predictions would necessitate the construction of inkblots that 
Systematically vary in structure. This formulation bears close affinity to 
the recent analysis of Rorschach protocols, based on Werner’s (1957) 
developmental stage theory of cognition, first applied by Friedman (1953) 
and summarized by Hemmendinger (1953, 1960). 

It is to be noted that the gestalt and psychoanalytic models of 
Stimulus functions have in common the conjecture that ambiguous 
stimuli heighten the influence of intrapersonal variables in perception. 

Given the empirical findings on ambiguous stimuli and the theoreti- 
cal formulations of psychoanalysis, it is not at all surprising that 
Rorschach (1921) and Christiana Morgan and Henry Murray (1935) chose 
Unclear visual stimuli to maximize the variance between the physical 
Stimulus and response output. Almost inevitably, the Rorschach and the 
Thematic Apperception Test (TAT) became viewed as open-end func- 
tions which tapped the major motivational, cognitive, and defensive 
Systems of individuals. They are open-end stimulus functions in the sense 
that any particular ambiguous stimulus is open to multiple perceptions 
and interpretations. Of course, such tests could equally well have been 
called ореп-еп or free-response assessment techniques to the extent that 
the subject has to produce his own responses. . | Е 

From the working assumption that ambiguous situations facilitated 
the operation of drive-oriented thinking, it was further assumed by pro- 
jective theorists that the greater the ambiguity, the greater the personality 
revelation, That is, the somewhat cavalier assumption was made of a 
Monotonic relationship between stimulus ambiguity and personality 
revelation. With respect to fantasy productions, McClelland (1958, p. 31) 
aS put this assumption rather succinctly in his statement that fantasy 
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"capitalizes on the generalization, repeatedly confirmed in studies of the 
effects of motivation on behavior, that the more ambiguous the situation, 
the greater the effect of motivation." This hypothesis is consonant with 
gestalt predictions, for Wertheimer (1957, p. 212) also posits, “With more 
ambiguous material, a greater influence of the personality factors can be 
expected." We shall cite evidence later on to show that this assumption 
may have a questionable empirical base. 

The foregoing discussion presents the stimulus model guiding early 
theorists and interpreters of projective techniques. Naturally the question 
arises as to whether this stimulus model is still adhered to in contemporary 
times. Indirect evidence of the continued acceptance of this model is 
shown by the results of a recent study by Seeman and Marks (1962). 
Thirty-six experienced Ph.D. clinical psychologists were asked to rank 
order tests on the dimensions of stimulus ambiguity, response freedom 
(degree of restriction of the response), and "depth" (availability to 
consciousness). The tests were the Bender-Gestalt, Rorschach, Sentence 
Completion, Strong, Szondi, TAT, and Word Association. The rank 
order correlation between stimulus ambiguity and depth was .86, reflect- 
ing the continued adherence to the proposition that the greater the 
stimulus ambiguity, the greater the depth or revealingness. The correlation 
between response freedom and stimulus ambiguity was .91. Thus the 
concept of stimulus ambiguity seems to imply 
degrees of freedom in response production. Interi 
freedom and depth evidently reflect the same dimensions in the thinking 
of clinical psychologists since the rank order correlation between the two 
variables was 1.00. 

With a projective model of this type, based largely on gestalt and 
analytic principles, it is not surprising that the specific role of the stimulus 
in evoking responses was largely overlooked by early theorists. Content 
and formal analysis of projective protocol data became personality 
centered, with the role of how the stimulus articulates with inner 
psychological systems ignored. In a singularly lucid and picturesque 
manner Rosenzweig (1951, p. 215) summed up this position when he 
said “The stimulus was dethroned. . . .” The stimulus remained in limbo 
until approximately 1950. An analysis of the historical reasons for the 


stimulus remaining in neglected confinement has been presented else- 
where (Kenny, 1961). 


the conception of many 
estingly enough, response 


C. REcE 


NT DEVELOPMENTS FocusiNG ATTENTION ON STIMULUS FUNC- 
TIONS 


The past decade or so has witnessed 


a gradual renaissance of theoreti- 
cal and empirical interest 


in the role that stimuli play in prompting 
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responses to projective techniques, especially for thematic apperceptive 
methods. What factors contributed to the restoration of the stimulus to 
grace? 

Certain developments within psychology as a whole undoubtedly 
provided the strongest impetus to a fresh interest in the stimulus. Two, 
somewhat correlated, developments should be singled out for special 
mention. In the first instance, the examination of the behavioral deter- 
minants in perception, in contrast to the autochthonous factors, helped to 
rekindle an interest in the perceptual aspects of projective techniques. 
Bartlett’s (1932) and Sherif's (1935) studies on autism set the stage for 
this research trend, as did the pioneering studies of Sanford (1936, 1937) 
and Levine, Chein, and Murphy (1942) on the influence of the hunger 
drive upon the interpretation of ambiguous figures. The broad assump- 
tion that needs influence perception was given close attention by the 
studies of Bruner and Postman and their colleagues (e.g., Bruner, 1951; 
Bruner and Postman, 1947; Bruner and Postman, 1949; Postman, Bruner, 
and McGinnies, 1948; Postman and Solomon, 1949) and a legion of other 
investigators. 

The studies on perceptual defense and vigilance soon came under a 
bombardment of criticism, especially for failing to control adequately 
the nature of the stimulus material. Two of these attacks centered around 
the necessity of controlling the relative frequency of word usage (Solomon 
and Howes, 1951) and of ensuring the emotional relevance of the stimuli 
to the subjects (Chodorkoff, 1955). | m 

That the stimulus properties in such experiments are intimately 
associated with the resulting perceptual reactions was hinted at by one 


of the earliest studies in this broad area of investigation. The results 


of Levine, Chein, and Murphy (1942) suggested that the more ambiguous 
Stimuli favored reality processes, whereas the less ambiguous figures 
facilitated the operation of autistic processes. Not only did this experi- 
ment demonstrate the need for careful manipulation of stimulus material, 
but it also questioned the validity of the assumption that the influence of 
need increases with increasing stimulus ambiguity. А | 

ancy of these experiments to projective techniques was 
helped to make clear the fact that any interpretation 
terms of personality dynamics alone would be 
howed unequivocally that the findings were 


The relev 
obvious, in that they 
of projective material in 
grossly deficient. The data s 


partly dependent upon the stimulus. | 
Further developments within psychology also helped personologists 


to look again to the stimulus, namely, the new theorizing on perception 
(e.g, Bruner, 1948, 1957) and the advances in psychoanalytic ego 
psychology (e.g., Hartmann, 1950, 1958). This new theorizing indicated 
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that a "new" look had to be given to the role that stimulus variables 
might play in perception and in projective methods. 

Thus developments within psychology as a whole contributed to a 
renewed appreciation of the importance of stimulus functions in be- 
havior determination. The converge 
ments within projective techniques 
ened interest in the stimuli of these 


research journals in the early 1950's were honeycombed with evidence of 
the failures of the Rorschach and the TAT in personality assessment. 
Kelly (1954), reflecting the growing belief th 


validity, suggested that social psychologists might wish to study the ad- 
herence to tests in the face of so much negative validity. (For the clinical 
psychologist, projective techniques are like theories to many experimental 
psychologists; they will not be given up until supplanted by more valid 
techniques.) The problems encountered in attempting to establish the 
validity of the two standard projective instruments led many investigators 
to attempt construction of their own custom-built projective tests. The 
latter instruments immediately caused the researcher to look at the 


stimulus, and, as a consequence, much new evidence was produced about 


а 
stimulus functions in projective test material. Some of this provocative 
research will be examined in a I 


ater section. On the other hand, a small 
number of investigators had always been interested in determining the 
role that the stimulus played in projective instruments, Some investiga- 
tors, for example, were examining the problem of whether or not “color 
shock” on the Rorschach is attributable to the effects of color by remov- 
ing the color from the chromatic cards, Other investigators were con- 


cerned with the issue of stimulus ambiguity and personality revelation 
in that ТАТ. 


nce of this trend with two develop- 
themselves helped to forge a sharp- 
instruments. On the one hand, the 


at these techniques lacked 


The coalescence of these trends within 
those in the projective area has recentl 
look more closely at the stimulus aspect or input of projective techniques. 
The results of these studies will ultimately be useful in the construction 
of better personality assessment techniqus and will also help clinical 


psychologists and others to understand more fully how projective instru- 
ments function psychologically. 


psychology as a whole with 
y prompted many investigators to 


П. A Set of Stimulus Postulates for Projective Methods 

As the preceding discussion has shown, early attempts to specify the 
stimulus model of projective techniques usually involved weak analogiz- 
ing from psychoanalytic and gestalt theory. Many assumptions about the 


stimulus were implied, but not explicitly stated. We shall now try to 
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make these assumptions more explicit. A set of stimulus postulates will 
be developed, from which it is hoped that specific predictions may be 
derived, and tested. The postulate set requires verification. 

The first systematic endeavor to specify clearly the interpretative 
assumptions, including those relating to the stimulus, was made by 
Lindzey (1952) for the TAT. Similar, but more modest, attempts have 
been provided by Henry (1956), Kagan (1960), and Murstein (1963b). 
These beginnings were expanded upon by Lindzey (1961) for all pro- 
jective techniques in his book Projective Techniques and Cross-Cultural 
Research. However, since these endeavors have glossed over stimulus 
assumptions somewhat, our purpose is to bring into sharper focus the 
stimulus postulates that seem to be reasonable in terms of theory and 
empirical evidence. 

As a starting point, the protopostulate for the entire set is: Responses 
produced to projective techniques are controlled, to varying degrees, by 
by the projective stimulus. 

Researchers and practicing clinical psychologists have pretty well 
abandoned the earlier notion that projective techniques stand majestically 
above the influence of stimulus variables, revealing in pristine purity 
true personality variance. Three major kinds of empirical data have 
helped to fashion the proposition that responses to projective techniques 
are deeply dependent upon the stimulus. Since most of the important 
research will be discussed in later sections of this paper, no attempt is 
made to provide a detailed discussion of it at this point. The diverse 
kinds of data indicating that the stimulus, in part, controls the response 
to projective stimuli are as follows: 


1. Normative Data 


Popular responses on the Rorschach indicate that specific areas of 
this test are well structured, at least for certain age, sex, and cultural 
groups. Moreover, the fact that only 10% of the percepts to the Rorschach 
are “man-made” objects may be attributed to the inkblot stimuli (Ed- 
monston and Griffith, 1958). The general shape and form of the inkblots 
restrict the type of response that may be made to the Rorschach. Similarly, 
common themes to particular TAT cards (Cox and Sargent, 1950; Eron, 
1948, 1950; Lindzey and Goldberg, 1953; Rosenzweig and Fleming, 1949; 
Wittenborn, 1949) support the proposition that responses to thematic 
apperception techniques are influenced significantly by the stimulus. No 
useful purpose would be served by reviewing normative data in this paper. 
We shall indicate briefly some of their implications. 

One of the most important implications of normative data is that 
clinical psychologists must eliminate from the total test variance that part 
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due to the stimulus properties of their instruments. Research is required 
on how best to do this. At the moment, clinicians must perform this deli- 
cate task in a highly subjective manner. A related implication of norma- 
tive data is that an individual's level of psychological functioning will 
vary within an instrument as a consequence of the varying stimulus pull 
of the projective test material. That is to say, it would be unwise to 
interpret all data from a single test as if it were originating from a single 
level of personality functioning. 

'The last item deserves more attention than in has received in either 
research or the clinical application of projective tests. Projective per- 
sonologists have frequently made the assumption that similar responses 


represent similar intraindividual dynamics. However, the data on com- 
mon themes and popular respons 


very questionable. A more reasonz 


consistency in the same class of behavior due to functionally similar test 


ychological functioning than does 
me class of behavior to dissimilar 
test stimuli. This supposition immediately leads one to the representative- 
ness and type of stimuli that should be used within a projective test. This 
topic is discussed under the first postulate. 


2. Connotative Meanings of the Stimuli 


The connotative meaning of the Rorschach inkblots has received a 
great deal of attention recently, with the result that it is now clear that 
there is a great deal of communality in the meanings assigned to specific 
inkblots. Findings from this research show that the percepts to inkblots 
are, in part, anchored to the stimulus properties of the inkblots. 

Historically speaking, the recent rash of studies on the connotative 
meanings of inkblots stems from the hypothesis that specific Rorschach 
inkblots have special symbolic meanings. Card IV, for ex 
so-called "father" card, Card VI is the "sex" card, and Card 
"mother." In early research (e.g., Meer and Singer, 
De Vos and Murakami, 1958) on the mother-father cards, subjects (55) 
were asked to indicate the card that reminded them of father and the 
card that reminded them of mother. "Today, however, Osgood's semantic 
differential (Osgood, Suci, Tannenbaum, 1957) has become the popular 
device to investigate not only the father-mother hypothesis but also other 
stimulus values of the Rorschach (Little, 1959; Loiselle and Kleinschmidt, 
1963; Rabin, 1959; Rosen, 1960; Zax and Benham, 1961; Zax and Loiselle, 
1960a,b; Zax, Loiselle, and Karras, 1960). The semantic differentia] has 
also been used to assess the stimulus properties of the Blacky Picture Test 
(Stricker, 1963) and of the Bender-Gestalt (Tolar, 1960). 


ample, is the 
VII indicates 
1950; Taniguchi, 
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The most striking contribution that the semantic differential studies 
have made is the general finding that each Rorschach inkblot has a some- 
what unique meaning. A consensus of the findings on the semantic dif- 
ferential research and the specific studies (Levy, 1958; Mayer and Binz, 
1961; Meer and Singer, 1950; Rosen, 1951; Sappenfield, 1961; Schleifer 
and Hire, 1960; Taniguchi et al., 1958) testing the mother and father 
notions permits the following characterizations of the Rorschach inkblots: 


Card I: ugly, dirty, cruel, large, strong, rugged, active, ferocious 

Card II: happy, strong, active, fast 

Card III: good, clean, happy, light, active, fast 

Card IV: bad, dirty, cruel, strong, heavy, slow, ferocious, masculine 

Card V: light, active 

Card VI: large 

Card VII: good, beautiful, clean, happy, light, delicate, peaceful, 
feminine 

Card VIII: clean, active 

Card IX: strong, active, hot 

Card X: good, beautiful, clean, happy, light, active, fast 


While the foregoing data show that individual inkblots tend to 
instigate somewhat specific connotative reactions, they do not, however, 
indicate what specific properties of the inkblots determine the reactions. 


3. Experimental Man ipulations 


Much attention has been given to the general problem of how pro- 
jective responses change as a function of experimental manipulation of 
the projective stimulus. Researchers (e.g., Framo, 1952; Horiuchi, 1961; 
Stein, 1949) have impoverished Rorschach inkblots physically by tachis- 
toscopic techniques. They have shown generally that the form quality of 
whole responses (IV) and percentages of usual detail responses (D) increase 
with longer exposure times. Another kind of stimulus manipulation of 
the Rorschach has involved alteration of some assumed critical property 
of the inkblots, such as color. Since this type of study will be reviewed in 
depth later, it suffices to say at this point that such stimulus changes do 
produce some shifts in the response. 

Similarly, manipulation of the stimulus properties of thematic apper- 
ception pictures has received a great deal of attention. Several workers 
have examined fantasy production as a function of similarity between the 
a picture and the subject. Investigators have also tested 

a monotonic relationship between stimulus ambiguity 
revelation. Several recent studies have focused on the 
t likely to reveal drive manifesta- 


central figure in 
the hypothesis of 


and personality 
problem of what kind of picture is mos 
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tions. These and other tantalizing questions will be examined in later 
sections. Accordingly, it need only be stated that the research evidence 
shows that there are important interactions between projective test per- 
formance and stimulus properties of the pictures. 

At this juncture in the development of projective techniques, what 
is required is a more precise delineation of these interactions. An ae 
tempt will be made in this Paper to present a framework within which 
the interactions between stimulus properties of projective materials and 
response performance can be intensively explored. In essence, then, the 
following set of postulates attempts to provide this framework. 


A. POSTULATE ONE: PSYCHOLOGICAL 


Processes As A FUNCTION OF 
TEST STIMULUS Моры. 


Similar responses to a projective 
gous dynamics as a test approximate 
stimulus model, 

Following the views expressed by Fiske (1963), three test stimuli 
models may be distinguished in current projective methods. The com- 
monest model today is the pure relative frequency one and is best exem- 
plified by the Rorschach and the TAT. Other perceptual tests based on 
this model are the Behn-Rorschach (Zulliger, 1946), the Holtzman-Inkblot 
Technique (Holtzman, Thorpe, Swartz, and Herron, 1961), and the Z- 
test (Zulliger, 1948). The Picture Story Test (Symonds, 1949), the Michi- 
gan Picture Test (Andrew, Walton, Hartwell, and Hutt, 1951; Andrew, 
Hartwell, Hutt, and Walton, 1953), the Make-A-Picture Story Test 
(Shneidman, 1948), the Four Picture Test (Lennep, 1948), the Children’s 
Apperception Test (CAT) (Bellak, 1954), and the Object Relations Test 
(Phillipson, 1955) may serve as additional examples for constructive 
projective techniques which employ the pure relative frequency model. 

In this model, many different stimuli are п 
personality characteristics and an inference 
characteristic is formed by counting similar response productions from 
relatively different test stimuli. For example, on the Rorschach one counts 
the number of whole responses, major detail] responses, minor detail re: 
sponses, and the like. On the TAT, we count the number of aggressive 
themes, succorance themes, achievement themes, and so on. It is to be 
noticed that the same stimuli are used to assess different perso 
ables and, as a consequence, the counting operation js perfo 
pendent of the stimuli. | | 

A second and increasingly popular approach with 
is to custom-build tests based on the specific relativ 
With this model the investigator selects stimuli for 


test will increasingly reflect analo- 
'5 а cumulative homogeneity test 


sed to assess numerous 
about a particular personality 


nality vari- 
rmed inde- 


projective methods 
ie frequency model. 
the measurement of 
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a specific personality characteristic, and simply counts the number of 
times that the characteristic manifests itself in the response productions. 
Unlike the next model to be described, no attempt is made to order the 
stimuli in terms of their evoking strength for the particular personality 
variable under consideration. In principle, any stimulus which seems 
relevant to the personality dimension under investigation will do. The 
general usefulness of this model was highlighted by McClelland (McClel- 
land, Atkinson, Clark and Lowell, 1953) and his colleagues, who con- 
structed TAT-type pictures specifically designed to engage the achieve- 
ment motive. Kagan (1956) made use of this test model when he 
constructed a special set of TAT pictures to measure aggression toward 
peers. The Levy Movement cards (Rust, 1948) is a perceptual test employ- 
ing the relative frequency model. The use of doll play with structured 
questions around sibling rivalry also employs this model (Levy, 1937). 

A third, but still relatively uncommon, approach with projective 
techniques is the cumulative homogeneity model. The two distinguishing 
features of this model are that the stimuli are ordered by scalogram 
analysis along a continuum of decreasing ambiguity with which they 
measure a single personality characteristic. Since this model assumes that 
the stimuli have exactly the same order for all subjects, any two subjects 
with the same total score should have identical answer patterns. For 
projective techniques, the stimuli would be ordered in terms of increasing 
percentage of individuals manifesting a given variable, such that once 
an individual gives a response reflective of the variable, he should con- 
tinue to give similar responses to the remaining stimuli. Thus the 
stimuli are arranged in order of increasing stimulus demand for a vari- 
able. A subject with a weak tendency for a variable would have to con- 
tinue up the scale of decreasing ambiguity much further than would a 
person with either a moderate or strong tendency. Preferably, a rec- 
tangular distribution of stimulus demand frequencies should obtain for 
such a test, with a minimum of stimulus items of demand frequencies 
near .5 since such items are unstable (Fiske, 1963). 

Guttman's (1950) scalogram analysis was successfully applied by 
Lesser (1958) in the construction of 10 pictures to measure fantasy ag- 
gression. This study provides convincing evidence that the cumulative 
ay be applied to projective techniques, providing 


homogeneity model m | ; 
e the elimination 


that the test constructor designs his materials to ensur 
of stimulus elements that may evoke unwanted response systems. Auld, 
Eron, and Гаа! (1955) were only partially successful in their application 
of Guttman's scaling method to the TAT because the pictures were not 
purified of unwanted stimulus elements. A method basically similar to 


scalogram analysis, but dealing more directly with the concept of a thresh- 
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old, has been employed by Barron (1955) to inkblots. In fact, Barron's 
achromatic inkblots, constructed Specifically to provide a threshold 
measure of human movement (M), would lend themselves quite readily to 
à scalogram analysis. A study of Spivak, Levine, and Sprigle (1958) with 
Barron's series of blots show that the percentage of subjects giving M to 
the blots ranged from .00 to .72, with an average increment of .03 between 
adjacent inkblots. Moreover, their data indicate that a good approxima- 
tion to a rectangular distribution could be obtained with only 18 of the 
original 26 inkblots, with an average step interval of .05. 

After this brief examination of models underlying various kinds of 
projective methods, we may now return to the original proposition that 
similar responses to a projective test will increasingly reflect analogous 
dynamics as the test approximates a cumulative homogeneity model. This 
postulate explicitly asserts that the pure relative frequency and the 
cumulative homogeneity models represent polar opposites on a contin- 


uum of internal consistency, with the specific relative frequency model 
in the middle, 


The published data on projective techniq 
relative frequency model indicate th 
superficially similar responses 
[see Jensen (1959) for a com 


]ues based on the pure 
àt prudence must be shown before 
аге combined to reflect similar dynamics 


prehensive review of the literature on the 
internal consistency of standard projective methods] The vital impor- 


tance of cautiousness is indicated by the results of a rarely cited study by 
Wittenborn (1950). He examined the hypothesis that responses to the 
TAT which are similar to each other should be more frequently related 
to each other than should dissimilar responses. Three separate tests of 
this hypothesis were m analysis, he tested for the inter- 
ап” response role 


or significant interrelationships 


for cards 
response categories were not sig- 
alysis tested for the 
mong "hostile" responses given to cards 7BM, 7GF, 4, 6BM, 6GF, and 
ЗМЕ. The data also failed to support the assumption that hostile feelings 
eported to different TAT cards will be significantly related to each other. 

Confirmation of Wittenborn’s early finding that similar responses 
o different TAT cards reflect dissimilar dynamics is contained in the 
tudies by Child, Frank, and Storm (1956), Lindzey and Goldberg (1953), 
nd Epstein (1964). Employing a group-administered TAT, Child, Frank, 
nd Storm (1956) found that internal consistency estimates for 10 need 


interrelations 


STIMULUS FUNCTIONS IN PROJECTIVE TECHNIQUES 299 


variables ranged from —.07 to .34, with a mean of only .13. Similarly, 
Lindzey and Herman (1955) obtained split-half reliability figures ranging 
from .12 to .45, with 8 TAT cards on six different personality variables. 
In a recent study (Epstein, 1964), conducted under the author's guidance, 
a rank order method was first used to order nine TAT pictures in terms of 
hostility. The initial nine cards (1, 2, 3BM, 4, 6BM, 11, 14, 18BM, 18GF) 
were selected on the basis of experience to represent a range of aggression 
in terms of card pull. Cards 2, 4, and 6BM were eliminated from further 
study because there was considerable variation between subjects in the 
ranks assigned to them. Cards 1 and 14 were designated the low-aggressive, 
cards 3BM and 11 as moderately aggressive, and cards 18BM and 18GF as 
highly aggressive. Previous research and clinical experience would seem 
to support the classification of these six ТАТ cards. Forty volunteers from 
an introductory course in psychology told TAT stories to these cards. 
The stories were scored on Stone's (1956) aggressive content scale, Purcell's 
(1956) internally based punishment score, and his externally based punish- 


ment score. Internal consistency was assessed by intercorrelating the data 


on each of the three sets of aggressive cards for the three aggressive 
measures. Of the nine correlations, only one was significantly different 
from zero. The correlation between the low and the medium aggressive 
cards on internal punishment was 48 (p < .05). The results of this study 
provide further support for the proposition that superficially homoge- 
neous responses to projective tests based on the pure relative frequency 
model may not be assessing similar psychological dynamics. 

While the first postulate leads to the prediction that, in general, a 
projective test composed of homogeneous stimuli will be more sensitive 
to analogous psychological processes, direct empirical evidence is lacking 
on this prediction for projective tests based on either the specific relative 
frequency model or the cumulative frequency model. However, this as- 
sumption has received some highly indirect empirical support from a 
study by Carr (1956) on intraindividual consistency in response to pro- 
jective tests of varying degrees of ambiguity. The major finding of this 
research was that there was greatest consistency in level of inference about 
affect between the TAT and Rorschach and between the TAT and 
Sentence Completion Test. If the Rorschach and Sentence Completion 
test stimuli are thought of as extremes on a continuum of stimulus 
homogeneity, then the finding of greatest consistency between each of 
these projective techniques and the TAT lends indirect support for our 
assumption. This study also gives no support for the hypothesis of an in- 
verse relationship between psychological processes as a function of test 
stimulus dimensions. In other words, irrespective of the kind of stimuli 


employed, test indices of apparently similar psychological processes will 
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either produce positive relationships or no appreciable correlations. The 
correlations will not be negative, as some individuals have hypothesized. 

Summary. Despite the lack of direct evidence bearing on the first 
postulate, there is reason to believe that it will hold up because of the 
test construction procedures implied by the specific relative frequency 


model and the cumulative frequency model. Furthermore, theory and 
research in the fields of aptitude and 


attitude measurement point to the 
reasonableness of the postulate. 


В. PosruLATE Two: FUNCTIONAL. STIMULUS 


Valid interindividual and 
tion of constancy of the 
all individuals. 


Projective personologists have been slow in their acceptance of this 
postulate, perhaps because such a proposition severely complicates no-end 
clinical inferences and research. Research evidence in many different 
areas of psychology has gradually taught investigators at least one im- 
portant lesson about the stimulus in psychological experiments. It is 
that a subject may not be responding to the stimuli that the experimenter 
thinks he is. Hull (1943) was aware of this problem when he distinguished 
between the potential stimuli of à situation and the actual stimuli 
received by a subject. In a somewhat similar fashion, Underwood (1963), 
in a lucid discussion of stimulus selection in verbal learning, has dis- 


tinguished between the apparent or nominal stimulus and the actual or 
functional stimulus. The nominal stimulus is w 


presents to the subject, whereas, the functional stimulus refers, in part, 
to the properties of the nominal stimulus which the subject employs to 


cue his behavior. In many psychological experiments, there may be 
serious discrepancies between the nominal 


rendering any isomorphic assumption betw 
tional stimulus suspect. 

With respect to projective techniques, the nomin 
the psychologist presents to the individual. For ex. 
"My mother . . . ," the scene portrayed in picture 6 
Card VII of the Rorschach are nominal stimuli. For these materials the 
term functional stimuli refers to how an individual perceives the meaning 
of the item stem; how he perceives what is portrayed in picture 6BM; 
and what meaning he attributes to card VII of the Rorschach, Do each 
of these three projective items signify “mother” as a functional stimulus 
for most Ss? The information necessary to answer this query would be 
shown by the degree of consistency of perceptual reactions to t 
jective stimuli. That is, a functional stimulus is specified by 


group comparisons necessitate the assum p- 
functional stimulus on a projective technique for 


hat the experimenter 


and functional stimulus, 
cen the nominal and func- 


al stimulus is what 
ample, the item stem 
BM of the TAT, and 


ese pro- 
the Perceptual 


2 
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decoding of the nominal stimulus by the individual. A simple physical 
specification of the nominal stimulus will not enable the investigator to 
discover the functional stimulus. The notion of a functional stimulus 
is expressed by the following proposition: 

FS — f(Dimensions of nominal stimulus €» decoding schemes) 


where the double arrow indicates that the stimulus dimensions of the 
nominal stimulus and the decoding systems of an individual interact 
continuously with one another. 

The assumption that the FS must be the same for all individuals, if 
valid individual or group comparisons are to be made, may be brought 
into initial focus by dealing first with the problem of cross-cultural com- 
parisons with projective techniques. Suppose we wish to compare n- 
Achievement between the Wisconsin Menomini and Algerian Arabs by 
means of a TAT-type test. Before constructing his test pictures, the 
projective personologist would obtain ethnographic data on the kinds of 
cues in each society which arouse achievement-oriented dispositions. Suc- 
cessful construction of one set of pictures, suitable for both groups, de- 
pends upon discovering trans-cultural cues for achievement arousal. If 
the nominal picture stimuli do not possess trans-cultural equivalence or 
functional stimulus equivalence for both groups, then group differences 
between the Arabs and Monomini could not be attributed to differences 
in achievement drive. That is, differential group perceptions of what is 
portrayed in the pictures would vitiate any valid cross-cultural compari- 
sons. On the other hand, the investigator may construct different sets of 
pictures for the separate groups. In principle, such a procedure is entirely 
suitable, providing his ethnographic data permit him to match the cue 
strength for achievement in the two sets of pictures. Of course, the prac- 
tical problems involved in such an approach are almost insurmountable, 
largely because of the absence of good ethnographic data which would 
enable the investigator to equate different nominal stimuli for functional 
stimulus equivalence. 

Most, if not all, projective personologists would readily grant the 
need for functional stimulus equivalence in a comparison of two widely 
different cultural groups. However, the whole thrust of the principle at 
issue has not been so widely accepted in the comparison of various sub- 
groups within the North American culture. It would be most unfortunate 
if the general principle of functional equivalence of stimulus meaning 
were disregarded when group comparisons were performed between 
groups within the same society, whether the comparisons be between 
males and females, young and old subjects, lower and middle class groups, 
or other similar group comparisons. 


302 DOUGLAS T. KENNY 


Consider, for example, the following problem: Suppose we wished to 
compare the achievement drive of Radcliffe students and home economics, 
students from a state university. It probably would be inappropriate to 
use a picture which presents a scene of a young lady baking a cake as a 
means of assessing the achievement drive differences between the two 
groups. It is most unlikely that such a scene would possess equal cue 
strength for achievement or similarity in perceptual meaning for the two 
groups. Likewise, Card 1 of the TAT probably would not be suitable in 
any valid comparisons of differences in the achievement drive for young 
and old persons, largely because older persons usually do not perceive 
this card as an achievementarousing scene (Atkinson, 1950). 

What is equally important, the second postulate applies not only to 
group comparisons but also to comparisons between individuals. It is not 
possible to assert that one individual has 
variable than another person, unless one can re 
individuals decoded the nominal stimulus into the same functional 
stimulus. That is, valid individual comparisons require the assumption 
that the individuals concerned agree on what they perceive in the projec 
tive stimulus. If one says it is an aggressive stimulus and the other says it 
is an achievement stimulus, then one is not in a position to say on the 
basis of TAT stories that one individual has more achievement or 
aggression than the other person. 

Until recent times, very little attention has been giv: 
of ensuring that the projective stimuli have equivalent functional 
properties before two or more individuals or groups are compared. 
ii be Seem ae gu construction 9f projective stimuli which 

have similz ctional stimulus pri 7 T MM 
make a significant contribution argon nd sree should 
Numerous studies (Lindzey, 1961) have Mike feld. 
А Y ws d i c projective techniques 
in cross-cultural research. While it is not surprising that these studies 
show many group discriminations, the exact meaning of the differences 
remains a puzzling issue. Within the context of this Paper, a basic factor 
producing the obtained differences may be attributable 3o differential 
perceptions of the meaning of the test materials. For example, the Ped 
of a group on the Rorschach to perceive the central Pw бй. Card I as 
human, the figures in Card III as human, and the whole blot ОС iv 
as a winged animal may only reflect cross-cultural] differences + ae 
tion. The form, shading, figure-ground relationships, "et n De Mos 
aspects of the inkblots readily lend themselves to "perceptual er sun 
divergent cultural groups. Cross-cultural differences may merely 5 e о 
A E: ; ; 2 y indicate 
"perceptual biases" and not genuine differences in Personality, As far as 
this author is aware, no attention has been given jn cross-cultural research 


more of one personality 
asonably assume that both 


en to the problem 
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to the problem of determining if individuals from different societies can 
actually perceive certain percepts within specifically designated areas of 
the Rorschach. Until such data are obtained, there is no need to assume 
that cross-cultural differences on the Rorschach originate in personality 
differences. 

Similarly, social class and age differences on the Rorschach for North 
American samples may only reflect differential “perceptual biases” and 
not differential personality functioning. While research (Auld, 1952; 
Haase, 1956; Riessman and Miller, 1958; Stone and Fiedler, 1956) indi- 
cates that there are social class differences on the Rorschach, it remains to 
be shown that these discriminations can be ascribed to personality 
variables, Similarly, studies (Douvan, 1958; McArthur, 1955; Мегой, 
Atkinson, Feld and Guria, 1960) with the TAT indicate that there are 
differences in need-Achievement between the middle and lower classes. 
The differences may be class-oriented "perceptual biases” transcending 
personality factors. We are again faced with the need for research to 
determine if members of different social classes can perceive percepts in 
specifically marked-out areas of the Rorschach. Moreover, the same 
matters are at issue in age and sex comparisons on the Rorschach and 
ТАТ. 

As part of a comprehensive research program on stimulus factors in 
thematic apperceptive techniques, this author and his associates have 
conducted a series of studies to determine if TAT cards have functional 
stimulus meaning for different groups. Underlying the methodology of 
these studies is the premise that when a projective stimulus is presented 
to a subject he will attempt to identify, categorize, or otherwise decode 
the nominal physical dimensions of the stimulus. Such decoding systems 
are the operational specifications of what we mean by functional stimuli. 
lf a person is asked to describe what he sees in card 3BM of the TAT, he 
may say, “This is a person who is broken up over something," or “The 
person is thinking of or has tried to take his own life.” Such reactions 
are attempts to assign the nominal stimuli of a TAT card to decoding 
systems. They represent what Bartlett calls “an effort after meaning” 
(1932, p- 44). While stimulus oriented, most individuals clearly go beyond 
the nominal physical stimulus properties of TAT cards when they are 
asked to describe what they see in each card. By decoding the nominal 
stimulus, the perceiver adds something to the nominal stimulus, thereby 
making the functional stimulus (decoding system) somewhat different 
from the nominal stimulus. 

The first phase of this research required Ss to examine each TAT 
card for a 20-second period and to "describe what is in each picture." 
The next step was similar to a critical incident study in that the card 
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descriptions were assigned to decoding categories that seemed best to 
reflect the codes used by the subjects in assigning functional stimulus 
properties to the TAT pictures. That is, the decoding systems бы А 
formed a priori, but were empirically derived from a detailed analysis о 
the perceptual reactions. In the first studies (Kenny, Harvey, and omes 
1961), a total of 40 hospitalized. psychiatric patients and 40 ша 
control $s were used to obtain the initial decoding reactions to 26 T AT 
cards—1, 2, 3BM, 3GF, 4, 5, 6BM, 7BM, 8BM, 9GF, 10, 11, 12M, 12BG, 
ІЗМЕ, 14, 15, 17BM, 17GF, 18BM, І8СЕ, 19, and 20. The Ss were told that 
the experimenter was interested in having people tell her what they see 1n 
pictures and that during a 20-second exposure of each card they were 
to describe what they saw in each picture. The patients (23 male and 
17 female) were severely disturbed psychoneurotics who were institution- 
alized in a provincial mental hospital, in w 


hich the maximum allowable 
period of hospitalization is 4 months. 


The controls or normals were 
matched with the patients for level of education. The modal number of 
perceptual codes, required to c 


ategorize the reactions of the Ss per card, 
was 8, with 


а range of 6 to 12. Scorer reliability, based on two independent 
judges assigning the perceptual reactions to the codes, was 91.2% agree- 
ment. 

Do severely disturbed neurotics and norm 
ing mechanisms in 
Out of a total of 
Square tests 


als employ different decod- 
assigning functional stimulus meaning to TAT cards? 
216 decoding categories, only 19 showed significant chi 
at the .01 or .05 level between the groups. 
be concluded that the functional stimulus meanin 
normal and neurotic adults manifests high communality, 

The propitious finding of a striking congruence between normals 
and neurotics in their decoding systems should not be generalized to 
other group comparisons. In order to determine whether extreme cultural 
differences make for dissimilar decoding reactions to the TAT, another 
study (Moriya, 1962) compared the decoding mechanisms of Japanese 
and Canadian samples. On the basis of data obt \ 
studies in this series, the five most common perce 
for each TAT card were prepared and the Ss w 


Accordingly, it may 
g of TAT cards for 


ained from previous 
ptual coding reactions 


; еге asked to check the 
perceptual reaction which they thought best described what they saw in 


each TAT card. The Japanese sample consisted of 40 male and. 60 
female students of the Gakushuin University in Tokyo. The Canadian 
group consisted of 30 female and 54 male students from the U 
British Columbia. In general, the results indicated that 
decode the nominal stimulus of TAT pictures in signific 


niversity of 
Japanese Ss 
antly different 
оез not support 
ally stereotyped 


ways from Canadian Ss. Examination of these differences d 
ihe proposition that Japanese Ss decode the world in soci 
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ways. For example, the Japanese proportions for the "suicide" code was 
greater on Card 3BM, but smaller on Card 17GF than for the Canadian 
sample. The Canadian proportion for the "spying" code was greater on 
Card 9GF but smaller оп Card 5 when compared with those of the 
Japanese. On the other hand, some of the results do seem to reflect the 
strong cultural pressures within the Japanese culture upon perceptual 
codes. Without exception, the Japanese sample has larger proportions in 
the "sadness," "aggressive," and “disphoric” codes than does the Canadian 
group. 
It would seem, then, that culture is a significant determinant of the 
functional stimulus in a thematic apperceptive technique. Although the 
data on differences in perceptual codes between subgroups within one 
culture are scant, Veroff's (1961) data imply that blue-collar workers and 
white-collar workers may have differential perceptions of the stimulus 
meaning of achievement oriented pictures. Although problems of inter- 
pretation are posed by his study because no data were collected on the 
specific decoding categories used by his two groups, the finding that » 
Achievement scores based on pictures of blue-collar workers, in com- 
parison with pictures of white-collar workers, elicit more productive 
relationships to other variables for white-collar workers than for blue- 
collar workers suggests indirectly that the coding reactions of the two 
groups are probably different to the blue-collar pictures. In the same 
study, Veroff also showed college-educated women may interpret the 
functional stimulus of a picture of a career work setting differently from 
grade school-educated women. While the high school- and grade school- 
educated females appeared to interpret a career picture in terms of 
achievement, college-educated women viewed this picture as a cue for 
achievement-conflict. This general finding is in accordance with the 
notion that the [unctional stimulus of a career picture is not the same for 
the two groups. 

In the light of the foregoing findings and considerations, any valid 
comparison of two different groups must ensure that the functional 
stimulus of thematic apperception pictures is the same for both groups. 
That is, the decoding systems to pictures must be the same for different 
groups, if the results are to be attributed to personality factors. 

For some obscure reason, cross-cultural research and studies com- 
paring groups within a society continue to be marred by the failure to 
provide methodological control over the functional stimulus. While it is 
invidious to single out any one research from a host of studies as an 
illustration of the confounding effects of the functional stimulus, a 
recent contribution. (Bradburn, 1963) will be used to provide final 
clarification of the conceptual and methodological questions posed by our 
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second postulate. On the basis of previous research, Bradburn hypoth- 
esized that father dominance is associ : 
Since interview data indicated that Turkish fathers are more ар 
than American fathers, he predicted that Turkish junior сше) bs 
compared with American junior executives, would have significantly 


А MEC NADA -ribed 
lower achievement. Both groups told TAT stories to six pictures descri 
in Atkinson (1958) and the data show 


significantly higher achievement scores. 
clusion that the d 


1 1 1 п. 
ated with low achievement in me 


В » had 
that the. American sample ha 
While it was Bradburn's con 


X pictures possess trans-cultural equivalence for 
achievement cues. It is entirely possible in this study that the average 
differences between American and Turkish males are due to the 
Strengths that the six А тегісап-соп" 
groups. That is, the obtained differences 
act that the nominal picture stimuli could be more 


readily decoded as reflecting achieving cues by American than Turkish 


subjects. 


Summary. Research aimed 
establish that projective stimuli еу 
groups. Unless the inv 
stimulus meaning, 


; t 
at between-group comparisons mus 
oke the same decoding systems for ү 
estigator controls for trans-group equivalence О 
his results must be viewed with reservations. 

C. POSTULATE THREE: 


PROJECTION AS 
SIMILARITY 


A FUNCTION or PERCEIVED 

Attribution of tension-producing affects in awareness will increase as 
a function of increasing perceived similarity between the individual and 
the projective stimulus; attribution of tension-producing affects not in 


consciousness will increase as à function of decreasing perceived similarity 
between the individual and the projective stimulus, 


Most recently, the empirical evidence and theor 
investigators (Feshbach and Feshbach, 1963; Fe; 
have indicated that affective attribution is de 


affective state of the perceiver but also upon the nature of the perceived 
stimulus. The following discussion takes as 


its frame of reference a 
recent theoretical article by Singer (1963). 

In a pioneering and ingenious experiment on 
(1933) showed that frightened female children Perceiy, 
as malicious. Murray called this type of affective 
tary projection.” The attribution to a stimulus 
present in the perceiver may be called supplem, 


izing of a trio of 
shbach and Singer, 1957) 
Pendent not only upon the 


Projection, Murray 
€ pictures of adults 
attribution “complemen- 
affect 
entary projection.” 
ty of Supplementary 


STIMULUS FUNCTIONS IN PROJECTIVE TECHNIQUES 307 


projection effects in his experiment, nor did he attempt to determine how 
projection is critically dependent upon stimulus variables. 

In the first of their studies, Feshbach and Singer (1957) used three 
groups, a control a fearexpression group (who were encouraged to 
become conscious of their anxiety feelings), and a fear-suppression group 
(who were encouraged to suppress their affect). In support of their pre- 
dictions, both fear groups projected fear onto a male graduate student 
appearing in a film, with the fear-suppression group ascribing more 
anxiety to the stimulus figure than the fear-expression group. The two 
most interesting features of these findings are that supplementary pro- 
jection was demonstrated and that suppression of affect facilitates 
projection. In another follow-up study on the projection of anxiety, 
Singer and Feshbach (1962) showed that anxiety-aroused normals and 
psychotics attributed more anxiety to a series of human pictures than did 
control groups. More importantly, the specific prediction that. psychotics 
would attribute more fear to a set of pictures labeled as "mentally ill" 
persons than to "never mentally ill" stimuli was supported. 

Since these two experiments and Murray's left partially unanswered 
the question of what conditions generate supplementary or complemen- 
tary projection, Feshbach and Feshbach (1963) hypothesized that the 
type of projection manifested will be contingent upon the relationship 
between the perceiver and the stimulus figure being perceived. Specifically, 
they predicted that supplementary projection will occur to stimulus 
figures viewed as similar to the perceiver, but that complementary pro- 
jection will occur to stimulus figures perceived as dissimilar. In the study, 
young boys judged 16 pictorial stimuli, equally divided among men, 
women, boys, and girls, for the personality characteristics of malicious- 
ness and fearfulness. In accordance with their predictions, the fear- 
aroused children, as compared with the controls, manifested complemen- 
tary projection to the adult male stimuli and supplementary projection to 
young boy pictures. Such results strongly suggest that if the stimulus 
figure is perceived as similar to the perceiver, then direct attribution of 
affect is to be expected; on the other hand, dissimilar stimuli will 
instigate complementary projection. 

The findings from the foregoing experiments indicate that anxiety 
affects will be projected in accord with the third postulate. Will the 
assumption apply to other affects? In an unpublished study discussed by 
Singer (1963), it was shown that college males and females who were made 
hostile attributed more hostility to college-age male stimuli than to a 
12-year-old boy pictorial stimulus. In another study examining the 
interactions between the similarity of the perceiver and the projective 
stimulus, Feshbach, Singer, and Feshbach (1963) had college students 
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describe 26 stick figures and 28 TAT-like two-person interaction ers 
Both classes of stimuli conveyed the impression of hostility. In the T? 
type pictures, one person could be seen as frustrating another. = na 
series of the TAT pictures, five of the situations portrayed the receiver І 
the frustration as а college-age male, and in the other four situaron 
he was a 12-year-old boy. In the other series of the TAT pictures ES 
ages of the stimulus figures were reversed, with the object of frustratio 
and its source similar to those in the first series. While there was pa 
difference between experimentally aroused hostile Ss and controls in 
attribution of hostile affect to the stick figures, both male and temale 
angered Ss showed a heightened increment in the ascription of hostility 
to same-age college male stimuli on the TAT pictures. The experi- 
mentally aroused Ss showed no significant increment in hostility attribu- 
tion to the younger stimulus individuals on the TAT pictures. Thus, the 
major findings of this study warrant the conclusion that hostile attribu- 
tion only occurs to same-age figures and not to stick figures or to younger 
boys. The conclusion provides evidence in support of the proposition that 
supplementary projection is contingent upon the perceived similarity 
between the stimulus and the perceiver, 

In another study in this series, Kiesler 
that college female Ss, who are made hostile but remain relatively 
unaware of their affects, attribute hostile feelings more to 
sons than to similar individuals. This finding 
notion that affects not in awareness w 
stimulus figures. 


and Singer (1963) reported 


dissimilar per- 
is consistent with the 
ill be attributed to dissimilar 


In the most recent of this provocative series of investigations, Fesh- 
bach (1963) showed clearly that supplementary projection takes place for 
a socially acceptable and positive affect, namely, that of "happiness." 
Twenty-nine female Ss were made to feel happy by the joint interaction 
of a monetary reward and successful performance on a jigsaw puzzle 
designed to measure spatial ability. Twelve of the 29 Ss were assigned to 
an affective restraint group by instructing them not to reveal their 
feelings to other Ss. The experimental subjects, plus an additional 27 
females in a control group, then evaluated for happiness 14 photographs 
of male and female pictures which had been Previously scaled by a 
modified Thurstone technique on a continuum of happiness-sadness. At 
least one male and female photograph appeared at each point on the 
scale. The results of the ratings supported the hypothesis th 
affect may be projected in that the afectively restrained 
more happiness to the happy-appearing female photo, 
controls. That supplementary projection depends up 
tween stimulus figure and perceiver was also upheld, 


at a positive 
8s attributed 
graphs than the 
on similarity be- 
as shown by the 


STIMULUS FUNCTIONS IN PROJECTIVE TECHNIQUES 309 


finding that there was more attribution of happiness to female faces than 
to male faces. 

The results of all these studies are, in general, congruent with what 
would be expected in terms of the third postulate. It should be emphasized 
that this assumption should hold even when socially unacceptable affects 
are at issue. A particularly striking example of the independence of this 
assumption from the social acceptability of affects is provided by the 
research of Bramel (1962). In his experiment, 5s were made to feel that 
they were homosexually aroused by male photographs. This procedure 
caused the Ss to overestimate the magnitude of homosexuality toward 
individuals favorably evaluated as belonging to the same social category. 
While the latter type of individual may be assumed to be more similar 
than dissimilar to the perceiver, Bramel (1963), in a partial replication of 
the original experiment, attempted to manipulate more explicitly the 


variable of similarity and dissimilarity. In this experiment, Bramel em- 


ployed the same technique as before to make the experimental Ss believe 


that they had homosexual motivation by having them observe their own 
deceptively high galvanic skin responses when viewing photographs of 
good-looking men in states of undress. Before watching the galvanometer 
dial, the experimental Ss had been led to believe that high galvanometer 
needle readings indicated homosexual arousal. For the control Ss, the 
needle remained at the low end of the galvanometer dial. After all Ss 
viewed and recorded their own need readings for 15 male photographs, 
ated the degree of homosexual tendencies of an individual telling 
stories to four T AT pictures. The actual stories contained no overt refer- 
ences to homosexual or heterosexual behavior. Before listening to the tapes 
of the TAT stories, half of 5s (the "student group") were told that the 
story teller was a student; the other half (the "criminal group") were told 
that the story teller was a criminal who had just been arrested for the third 
time after a series of armed robberies. In addition, all Ss rated themselves 
on homosexuality and how they compared themselves in similarity to the 
story teller. The latter ratings were performed to assess the adequacy 
of the induction of the two independent variables—homosexual arousal 
and the variable of similarity-dissimilarity. Bramel found that the experi- 
mental Ss ascribed more homosexuality to themselves, reflecting the 
adequacy of the experimental arousal of homosexuality; the Ss assigned to 
the student group rated themselves as more similar to the story teller than 
did those in the criminal group, indicating the adequacy of the similarity- 
dissimilarity induction. In conformity with the third postulate about 
projective techniques, Ss in the student group attributed more homosex- 
uality to the story teller than those in the criminal group. Thus, supple- 
mentary projection takes place to stimulus figures perceived as similar 


they rn 
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to the perceiver, even though the attributed conscious trait is undesirable. 

It is evident that the third postulate and its experimental evidence 
have profound implications for projective techniques. The fruitfulness 
of this assumption, over alternative ones, will now be considered with 


special reference to a sampling of related experimental and clinical prob- 
lems in the field of projective techniques. 


І. Hero Assumption 


In the clinical use of thematic apperceptiy 
play, it is sometimes assumed that the hero of Р 
identification figure in doll play reflects the characteristics of the subject 
and that the other figures in the Story or doll play indicate how the 
subject perceives other individuals in his environment. Lindzey (1952) 
considers the hero assumption as a necessary feature of the clinical use 
of the TAT. On the other hand, Piotrowski (1952) has made the assump- 
tion that all figures in a story are equally revealing of the story teller's 
personality. The large-scale research endeavors of McClelland et al. 
(1953) on the achievement motive follow Piotrowski's a 
issue. Surprisingly, however, this import 
little direct research. 

Lindzey and Kalnins (1958) undertook 
problem, making the assumption that 
tween hero and nonhero figures in 


e techniques and doll 
a TAT story or the central 


ssumption on this 
ant problem has generated very 


the only two studies on this 
a distinction should be made be- 


a TAT story. In the first of these 
studies, 30 undergraduate females were asked to evaluate the similarity 


between themselves and each character that they introduced into the six 
TAT stories they told. For the 180 stories, two independent raters agreed 
90% in identifying the hero figure in each story. The main results of this 
study showed that the hero figures were more often perceived as similar 
to self than were nonhero figures. While Lindzey and Kalnins accepted 
this finding as offering modest support for the hero assumption, it must 


be pointed out that their study is not free from methodological criticism. 
In any valid test of the hero assumption versus alternative assumptions, 
it seems necessary to obtain independent evidence of the attributes of the 
subjects and then assess the extent to which these attributes are ascribed 
to heroes and nonhero figures. Without this kind of data, the Lindzey and 
Kalnins study is most difficult to interpret. If independent data on the 
story teller are obtained, then the hero assumption would predict that 
hero figures in TAT stories would be ascribed more of the story teller’s 
attributes than would nonhero figures. Piotrowski's assumption would 
have to predict that these attributes be attributed equally to all figures. 
On the other hand, our third postulate would predict that attribution of 
traits would depend critically upon the perceived similarity of the story 
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teller to the stimulus figures in the TAT pictures and the degree of con- 
sciousness of the attributes being projected. This type of experiment has 
yet to be done. 

In the second study, Lindzey and Kalnins (1958) exposed the experi- 
mental Ss to a complex frustration of social motives and compared their 
TAT protocols to those of unfrustrated controls. They hypothesized 
that the critical comparison-between the hero assumption and Piotrow- 
ski’s alternative assumption (ie. that all figures are equally revealing) 
would be for the category of aggressive acts by nonhero figures in the 
stories. They argued that the hero assumption, in contrast to the aterna- 
tive assumption, would predict no increase in aggression between nonhero 
figures, The data confirmed the prediction deduced from the hero assump- 
tion. However, our third postulate suggests that the results for the cate- 
y nonhero figures could be made to vary experimen- 


gory of aggression b 
arity between these figures and the story 


tally, provided the perceived simil 


teller were manipulated appropriately. 
In light of the experimental findings that affects can be attributed 


to persons who are quite dissimilar to the perceiver, the hero assumption 


does not seem warranted in the interpretation of TAT stories. The in- 
ference from our postulate is that the story teller is quite likely to at- 
tribute affects to nonhero figures who are perceived as dissimilar to him. 
This observation should not be construed as implying that TAT protocol 
attempt to distinguish between hero and nonhero 
figures. Quite the contrary. Attributes ascribed to nonheroes, provided 
they are dissimilar to the story teller, are likely to reflect unconscious 
states. Attributes attributed to heroes, provided they are similar to the 
story teller, are likely to indicate conscious states. In essence, clinical 
interpretation of any projective material must be based upon the mean- 


ing that stimuli have for 


analysis should not 


patients. 


2. Assumptions with Respect to Physical Similarity between Stimulus 
ез and Projecting Individual 


to the theoretical and practical problem of what type 
of stimulus figures an individual can most readily identify himself with, 
thereby facilitating maximum projection. Most of the research on this 
problem has been designed to test the premise that there is a monotonic 
relationship between amount of projection and increases in physical 
similarity between the story teller and the figures on apperceptive pictures, 
The reader will recall that Murray's (1943) original division of his pic- 
tures into sets for men, women, boys, and girls assumes that age and sex 
similarity of the TAT figures will facilitate identification and projective 
mechanisms. Since then, Murray's assumption has served as a useful model 


Figur 


We turn now 
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for many other apperceptive test constructors who have designed special 
sets of pictorial stimuli. In this connection, Henry's (1947) set of line 
drawings for Hopi and Navaho children, Thompson's (1949) series of 
pictures with Negro figures, the Michigan Picture Test (Hartwell, Hutt, 
Andrew, and Walton, 1951) for grade school children, the set of pictorial 
cards depicting nuns developed by Lasago y Travieso and Martinez- 
Arango (1946) for the personality assessment of nuns, and Kagan's (1956) 
13 pictures to test aggression in boys between the ages of 10 and 13, may 
be cited as representative of authors who have, 
cepted the Murray model. 


Although a considerable amount of energy has been devoted to an 
examination of the assumed relation between physical similarity and 
projection, the obtained data present a problem in interpretation. In 
order to measure amount of projection attributable to differences in 
physical similarity between the projector and the stimulus figures in the 
pictorial stimuli, experimenters have commonly employed as their 
measures of the dependent variable, indices of story length, word counts, 
number of different themes, transcendence scores, level of productivity, 
and compliance with test instructions, However, there is little in the way 
of theoretical or experimental justification for the assumption that word 
counts and the like are valid or relevant measures of projection. Just 
because one set of pictorial figures evokes a higher word count or tran- 
scendence index than another set is by no means a relevant proof that 
the subjects are projecting more of their attributes to one of these sets. 
Such data, at least to this author, are irrelevant to the problem at hand. 
In order to test adequately the relation between physical similarity and 
projection, it 1$ necessary either to induce personality states experimen- 
tally or to obtain independent evidence that the Ss manifest certain per- 
sonality characteristics. Then, it must be shown that one kind of stimulus 
— | а better reflector T these attributes than another type of 
material. As а consequence of the failure to ai 1 А е 
attributes that individuals might project, the vpn, hi ао, Ша 
to be reviewed is of questionable character. For org; 
the studies will be grouped under the categories of identific 
animals, color-of-skin identification, bodily similarity identifi 
same-sex and opposite-sex identification, However, all studies deal directly 
or indirectly with the problem of the relation between the projector and 
the stimulus figures on pictorial stimuli (see also Murstein 1961 1963b). 

a. Identification with Animals. Ever since Bellak designed ihe Chil- 
dren Apperception Test (CAT), experimenters have been шї ЙУ 
vigorous in testing a subsidiary premise upon v 


п which this test was built. 
Bellak made the assumption that young children would more readily 


by varying degrees, ac- 


anizational purposes, 
ation with 
cation, and 
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identify with animal figures than human and it is this assumption which 
investigators have tested extensively. For obscure reasons, the central as- 
sumption of this technique, namely, that stimuli depicting eating, sibling 
rivalry, primal scenes, toilet settings, sleeping, and aggression evoke the 
cardinal problems of children, has remained untested. 

In an early study with children between the ages of 5 and 10 years, 
Bills (1950) found that rabbit pictures instigated significantly longer 
stories than TAT cards. Moreover, the psychological inferences which 
might be drawn from these two tests appeared to be quite different. In 
Bills, Leiman, and Thomas (1950) showed that 
neither the TAT nor the rabbit pictures produced material which agreed 
with data derived from play therapy sessions. In these two studies, ade- 
quate control was not exercised over the background features of the two 
sets of pictures. 

In the first systematic attempt to control for the stimulus values of 
the human and animal pictures, Biersdorf and Marcuse (1953) had an 
artist construct six animal pictures similar to those on the CAT plus 
another six comparable pictures with humans substituted for animals. 
They attempted to ensure that the two sets of pictures were identical 
in emotional expression and in most other respects. When the two tests 
were given to 30 first graders, no differences were found between the 
lices as number of ideas, number of words, reac- 
tion time, number of figures introduced into the stories who were not 
in the pictures, number of figures referred to in the pictures, and total 
response time. Two main criticisms of this kind of research are worth 
reiterating. First, it has already been pointed out that measures such as 
word count in no way indicate whether or not the Ss are projecting attri- 
butes which they possess. Оп this score alone, this and similar experiments 
do not provide a valid test of. Bellak's proposition. Second, the human 
pictures indicate far more clearly to the Ss the sex of the figures than do 
Bellak's animal pictures. If animal and human pictures are to be matched, 
ambiguity with respect to sex of the figures must be constant. 

Two additional studies have been conducted with the Marcuse pic- 
tures, Mainord and Marcuse (1954) gave the two sets of pictures to emo- 
tionally disturbed boys and girls whose mean age was 7 years. Again, 
simple quantitative indices showed no differences between the two kinds 
of pictures, although five clinicians indicated that the protocols based 
on the human pictures were clinically more useful than those based on 
the animal series. Interestingly enough, however, Budoff (1955) found 
that less well-adjusted children were more productive on the CAT pic- 
tures than on another set of comparable human pictures. In Budoff's 
study, normal children produced more revealing material on the human 


à subsequent study, 


techniques on such ind 
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series. Furuya (1957) administered the Marcuse pictures to 72 Japanese 
children, with an age range of 6 to 12 years. On 20 tests of significance, 
the human pictures yielded better results than the animal pictures for 
four indices. 

Light (1955) compared CAT cards with TAT cards, using 9- and 10- 
year-olds. While no significant differences in number of words occurred 
when the two sets of cards were compared, the CAT cards were more 
productive on a number of other criteria, such as expression of feelings, 
number of conflicts, and themes. Since this study failed to match the 
animal and human pictures on a number of critical stimulus dimensions, 
and contained no data on the personality attributes of the subjects which 
might be projected onto the respective figures, the findings are not per- 
tinent to the general problem at issue. Using a group of intellectually 
superior children in the first, second, and third grade in a comparison 
between the CAT and a somewhat similar set of human pictures, Arm- 
strong (1954) found that the human Pictures elicited a significantly 
greater number of statements that went beyond picture description. 


There were no differences, however, between the two sets of pictures on 
word count. 


In the most recent study on this Proposition, Weisskopf-Joelson and 
Foster (1962) attempted to remove some of the deficiencies contained in 
the foregoing researches. They used 40 kindergarten children, with а 
mean age of 6 years, 2 months, While the CAT is designed for children 
between the ages of 3 and 10, Bellak and Adelman (1960) feel that the 
issue of animal versus human pictures should be tested on young children. 
Many of the other studies have used children close to the 10-year limit. 
In addition, this study avoided the use of a counterbalanced design 
in which all Ss receive each animal picture and its matching human pic- 
ture. Counterbalanced designs may contaminate the results with strong 


differential carry-over effects from analogous pictures In order to match 
four CAT pictures with similar huma 


i in ceu. а Dietures, the investigators were 
forced to modify, probably in critical 


ө Ways, the САТ pictures. For ex- 
ample, two important changes were th 


ш е addition of clothes to the ani- 
mals and the omission of the mouse in Сата 3. While some children 


identify with the lien in Card 8, many take the mouse as the identifica- 
tion figure. While it would probably be inappropriate to design a picture 
with a human figure appearing out of a small hole in a baseboard, thë 
omission of the mouse means that the investigators ave sia: lowes ma 
ining the CAT. It would have been better to use one of the Б 

six cards. The results of this study sh 


: Р owed that the mean trar 
index of the human pictures did not differ significantly 
the animal pictures. 


remaining 
nscendence 
from that of 
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While most of these researches tend to show either no difference be- 
tween animal and human pictures or that the human pictures are slightly 
more productive, the limitations of these studies prevent the acceptance 
of the proposition that children will identify as readily or more readily 
with human figures and will project more onto humans. As already 
stressed, none of these investigations had independent information on 
the attributes which may have been projected. Researchers have yet to 
show sufficient ingenuity in ensuring adequate control over background 
factors in designing their sets of pictures. Research has been largely 
limited to older children. Moreover, some studies have employed counter- 
balanced designs in which going from A to B may not be the same as 
going from B to A. In short, this writer adopts the strong position that 
no study has valid data. This conclusion should be contrasted with 
Murstein's (1963b) assertion that, after reviewing the same studies, he had 
“laid to rest" the assumption about identification with animals. Bellak's 
hypothesis is deserving of more inventive research and treatment than 
it has received up to this time. 

Finally, it should be mentioned that no research has been conducted 
on the relative merits of animal and human stimuli in projective films. 
In contrast to still picture apperceptive techniques which commonly use 
human stimuli, filmed apperceptive techniques have primarily made use 
of puppets (Haworth, 1960; Lerner and Murphy, 1941) and animated 
cartoons (Gemelli, 1951). Apparently, the Test Filmique Thématique 
(Cohen-Séat and Rebéillard, 1955) is the only filmed technique that has 
exploited the possibilities of human stimuli. The research possibilities 
of filmed projective methods, contrasting the projective merits of various 
classes of stimuli figures, are almost limitless. Haworth (1960) should be 
consulted for a comprehensive review of films as a projective technique. 

b. Identification with Color of Skin. The question of whether Negro 
Ss might more readily identify with Negro figures on an apperceptive 
test was raised by Thompson (1949). He hypothesized that sparsity of 
dynamically relevant material in the protocol data of Negroes might be 
due to the white figures on the TAT. Accordingly, he substituted Negro 
figures for the white ones on the TAT and gave the new set of pictures, 
along with the analogous 10 pictures from the original TAT, to a group 
of 26 Southern Negro college students. The students produced signifi- 
cantly longer stories on the Negro series than on the traditional TAT, 
suggesting that projection is facilitated by color-of-skin similarity between 
the story teller and the figures in the pictorial stimuli. 

However, Thompson's modification of the original TAT for Negroes 
was quickly challenged. Riess, Schwartz, and Cottingham (1950) found no 
differences in story length between the Negro and white version of the 
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TAT for 30 Northern Negro and 30 Northern white college students. 
Analyzing the same data, Schwartz, Riess, and Cottingham (1951) showed 
that Negro students produce the same number of ideas to both sets 
of cards. Korchin, Mitchell, апа Meltzoff (1950) contributed another 
study on the two versions of the test. Again, story length did not differ 
between the two versions for 80 Northern Negroes and 80 Northern 
whites. Similarly, Light (1955) did not find any differences in story length 
between the TAT and the "Thompson modification for 26 students. In a 
more appropriate experimental design that avoids the difficulty of the 
counterbalanced designs used in the other studies, Cook (1953) tested one 
half of his Negro and white Ss with the Murray TAT and the remaining 
5s with the Thompson modification. In agreement with other studies, 
Cook found no significant differences due to version of the test. 

Aside from Thompson’s initial study, the main conclusion of these 
studies appears to be that physical similarity of skin color does not in- 
crease projection. However, these investigations suffer from the same 
limitation as those dealing with the animal versus human identification 
problem. In point of fact, there is no evidence that the Ss are attributing 
any attributes to either version of the TAT. Moreover, the assumption 
that any simple manipulation of skin color w 


" à Coen ill elicit perceptions of 
similarity and identification is tenuous, 


to say the least. 


И ОНИЕ ана Войну Similarity. А few investigators have 
attempted to test the formulation that | 


bodily similarity between story 
teller and the central figure on Т AT cards will increase the amount of 


projection. The measures of projection are usually word counts, tran- 
scendence indices, or level of productivity. The principal criticisms of 
these studies concern the correctness of the measures of projection and 
the crude manner of manipulating bodily similarity, As we have already 
indicated, the only way in which ап experimenter can be sure that а 
subject is projecting is to have prior knowledge of the subject's attributes. 
The similarity variable is something that resides within the subiect. That 
similarity is a psychologically mediated variable and not a en cn vari- 
able is illustrated by the finding (Clark and Clark, 1947: odoris 1946) 
that Negro and white preschoolers identify more strongl Nis PM white 
doll than the Negro doll when asked, "Which one looks like vou?” This; 
the kind of person an individual perceives himself Similar ^ фе 
correspond with the experimenter's Ртесопсејуес Notions. For example; 
when an experimenter draws a TAT figure as Obese, there s no warantes 
that an obese person will perceive himself or his body image i simili 
to the obese figure in a TAT picture, This kind of bie ah особе 
phenomenal judgments from the subjects as to the kinds of individuals 
they perceive themselves similar to in specifically defined role situations. 


“I 
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Armed with such judgments, the researcher could then manipulate the 
similarity variable in role settings represented on TAT pictures. With 
these reservations in mind, the three studies which have dealt with the 
body similarity problem will be quickly reviewed. 

Weisskopf and Dunlevy (1952) obtained no significant interaction 
for transcendence indices between type of subject variable (obsese, crip- 
pled, and normal) and type of central figure variable (obese, crippled, 
and normal). In a second study, Weisskopf-Joelson and Money (1953) 
investigated the effects of facial similarity between subject and central 
figure in the TAT on word counts and transcendence indices. In the 
neutral set of nine pictures, the facial features bore no resemblance to 
the subjects. In the experimental set, the facial features of the central 
figures were the photographic reproductions of the Ss’ faces. The results 
showed that facial similarity between the 5 and the central figure did not 
produce a significant change either in word counts or transcendence in- 
dices. Greenbaum, Qualtere, Carruth, and Cruickshank (1953), in an in- 
vestigation comparing TAT pictures and modified versions showing 
stimulus figures as physically handicapped, found that the unmodified 
TAT cards yielded a greater level of depth than the modified set. 

When considered against the bench marks established for this kind 
of study, it must be concluded that adequate experimentation has yet to 
appear concerning the relation between perceived bodily similarity of the 
story teller and the central figure. | А 

d. Identification with Same-Sex and Opposite-Sex Figure. At the level 
of theory, it is often assumed that 5s more readily identify with stimulus 
figures of their own sex. Primarily on the basis of developmental identi- 
fication with the same-sex parent and the sex typing of behavior in 
socialization practices, this assumption appears tenable. Thus, it seems 
reasonable to hypothesize that an individual will project tension-pro- 
ducing affects of which he is aware onto the same sex stimulus. However, 
tension-producing affects of which he is unaware should be projected 
onto opposite-sex figures. Does this twofold supposition conform to the 
experimental findings? 

In an early use of a filmed projective technique, McIntyre (1954) 
concluded that a viewer does not increase projection to the like-sex film 
protagonist. A similar finding was obtained by Silverstein (1959), who 
administered two parallel sets of male and female apperceptive pictures 
and a shortened version of the Edwards Personal Preference Schedule to 
40 male and 40 female college students. The thematic material and the 
Edwards schedule were scored for affiliation, aggression, autonomy, defer- 
ence, exhibition, nurturance, SCX, and succorance. The correlations be- 
tween the Personal Preference Schedule scores and those from the the- 
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matic apperceptive pictures for the samesex and opposite-sex figures 
were, in general, not significantly different from zero. Two major con- 
siderations vitiate any firm Conclusions from these studies. Both used 
self-questionnaire data as the basis for information about the attributes 
of the subjects. Furthermore, the Ss in both studies did not seem to have 
identified with either sex figure in the projective material. 

Moore and Schwartz (1963) have recently reported a comparison be- 
tween two modified versions of the Rosenzweig P-F Study in which the 
manipulated. Each version of the test 
ith Form I so designed that a male figure was 
artoons and a female figure was frustrated in 
male figures were frustrated in the 
rty-eight males and 59 females re- 
females responded to Form II. In 


first 12, and males in the next 19. Fo 
ceived Form I and 59 males and 51 

contrast to the TAT, the p.c Study requires the § to identify with the 
frustrated person and to provide an appropriate response for him. The 
seven Rozenzweig scoring Categories, when applied to the frustrated 
figures of the same sex and to those of the opposite sex revealed по 
significant findings. The authors concluded th О 
assumption of a relationship between 
torial stimuli. However, si 


heir Ss, their 
results are not germane to the assumpti 

While the preceding studies hav 
sex and opposite-sex identification, 
doubt that sex of the stimulus figure 
jection. 


€ not yielded valid evidence on same- 
there are others which leave little 


15 related to identification and pro: 


udy in which they tested 


like-sex character in a Hollywood 
was thought to provide comparabl, 
a boy or a girl as the identificand. An identification index showed that 
the male viewers identified with the boy character and female stendi 
with the female character. Comparable results in another tudy (Albert, 
1957) strengthen the supposition that Ss identify MOst stri : ly with 
same-sex stimulus figures. ongly 

However, it is possible for the ind 
of dissimilar sex. While sex identity undoubtedly helps 
ceived similarity between the subject and stimulus figures, similarit 
figures are embedded, the sex typing of the behavior 


i portrayed by the 
p 5 'ed consonanc қ 
stimulus figures, the perceived € between the motives and ac- 


ivi А А 
idual to choose ап identificand 


to mediate per- 
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tions of the stimulus figures and the subject, and the general adequacy of 
appropriate sexual identification within the subject are four important 
factors which we assume to mediate perceived similarity. These four 
variables must be taken into account before any adequate predictions 
can be made about identification with same-sex and opposite-sex figures. 
For example, if a male stimulus figure is engaged in sex-typed female 
activity, then a male subject may identify with an opposite sex figure, par- 
ticularly if she is engaged in masculine activity. We also suspect that the 
identificand may be dissimilar in sex to the subject when tension-pro- 
ducing affects are not in awareness or when the subject’s strongest devel- 
opmental identification is to a member of the opposite sex. 

It is these considerations that may help to clarify two of the most 
challenging and puzzling problems about achievement motivation in 
women. As a general background to research on the achievement motive, 
it should be mentioned that males have served as Ss in most of the 
studies that have demonstrated lawful relationships between achievement 
motive scores and performance variables. Studies on females have been 
few in number, largely because the preliminary research with females 
yielded equivocal data. As Lesser, Krawitz, and Packard (1963) have in- 
dicated, a comprehensive theory of the achievement motive should be 
able to incorporate the dynamics of achievement for both sexes. A theory 
of motivation, based on the findings of one sex, would be limited to ap- 
proximately half the population. 

In 1958, the two enigmas about the findings on the achievement 
motive for females were (а) why females did not increase in achievement 
under achievement-arousing operations, whereas males did; and (b) why 
women produced more achievement to pictures of males than to pictures 
of females. A decade ago, McClelland et al. (1953) reviewed three studies 
which showed that females did not show the expected increase in achieve- 
ment as a consequence of achievement-involving instructions. Studies by 
Veroff (1950) and Veroff, Wilcox, and Atkinson (1953) indicated that 
female high school Ss and female college Ss manifest more achievement 
to male picture cues than to female ones. This finding occurred under 
neutral and achievement-oriented instructions, although neither study 
produced a significant increase in achievement motivation under arousal 
conditions. However, an examination of three recent studies with female 
Ss which have shown an increase in achievement with achievement-ori- 


ented instructions suggests an important interaction effect between the 
achievement motive and picture cues for females. 

Angelini (1955) (reported on by Lesser et al., 1963) reported that 
competitive Brazilian college women showed a significant increase in 
achievement motive scores after achievement-arousal instructions which 
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Stressed appeals to leadership ability and intelligence. More significantly, 
perhaps, the Brazilian college women obtained higher achievement scores 
to pictures of males than did Brazilian ¢ 
It is important to note that Angelini used only male card stimuli. a 
Alper (1957) showed that the sex of the central figure in a TAT саго 
has an important influence on achievement for females. In her study, 
three groups of college girls were used, a group composed of 28 leaders 
who were tested under neutral instructions, a second group of 24 цол 
leaders who were tested. under €go-oriented instructions, and a third 
sample of 24 nonleaders who Were tested under relaxed conditions. Card 
І (boy and violin) of the TAT and Card Ше (young girl in empty class 
room) from the Michigan Picture Test were used in the data analysis: 
5, Alper found that. the nonleader ego-oriented 
higher achievement Scores than the relaxed non- 
5 for the male and female pictures showed (a) the 
leader group produced more achievement to the male picture than to the 
differences between the two pictures occurred 
ed nonleaders or the nonleader relaxed group: 
oduced higher achievement score 
axed group, as it also did 
arison with the nonleader 
iced a Significantly lowe? 
for the nonleader group: 
in Comparison with the 
ale picture produced signifi- 
the relaxed nonleaders than for the 
п of results indicates that a male picture 


More than а fem: ; female 
a Ё emale for either fema 
leaders or women who have been given Alive алчудан унтуп на. 


Since Alper did not have an achievement-aroused leader 
direct comparison with a nonaroused leader 
replication with the addition of an ; 
The Lesser et al. (1963) study explored 
that the failure of past investigators to 
experimentally in females might be due 
who have been used were not overly concerned with "e. ih 
3 ki " i а ‚ iro 
means of intellectual skills. To test this hypothesis. they used o j es 
i j igh sc for i ini 
and seniors from a high school for intellectually gifted girl “a d 
i is ci itive, а s. The ad- 
mission to this school is competitive, and, once Selected the stud » n 
1 l in a highly competitive environment Which err 1c en 5 
placed T : The $ . Paces Stress OW iniel. 
lectual and scholastic achievement. The Ss were divideq inis vts death 
i i 4 achievers atci 
groups of 40 achievers and 40 оа АП Ss were exposed to 
$ achiev -ori instructions. 2 
neutral and to achievement-orientec hs each of к, 


ә - of males. 
ollege men to pictures of male 


achievement score for the le; 
as it also did for the €go-oriented nonleaders 
nonleader relaxed group; and (Ч) the fem 
cantly more achievement for 

leaders. This complicated patterr 
is likely to arouse achievement 


resting hypothesis 
: achievement motive 
9 the fact that the women Ss 


Under 
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ditions, Ss wrote TAT stories to three pictures in which a male was the 
central figure and to three pictures in which a female was the central 
figure. While the main hypothesis of the study that the achievement 
se in achievement motivation was not 


instructions would yield an increa 
ntal treat- 


confirmed, a significant interaction among groups, experime 
The interaction showed that "achieving 
girls did display the expected increase in achievement motivation scores 
under achievement orienting conditions, but only when responding to 
stimuli depicting females; underachieving girls also displayed the ex- 
pected increase in achievement motivation scores under achievement 
orienting conditions, but only when responding to stimuli depicting 
males" (Lesser et al., 1963, p. 63). The authors applied a social role inter- 

They made the assumption that achieving 


pretation to their results. 
females conceive of intellectual achievement as an important part of their 


role, whereas underachieving girls perceive that the pursuit of intellectual 
achievement is part of the male role. In addition, they made the assump- 
tion that the attribution of the aroused motivation of achievement in the 
underachievers would be directed to male figures who are perceived as the 
achievement. expressions. Thus, achieving girls will 
ent to female figures and underachieving girls will 
attribute achievement to males. This social role explanation of the 
obtained results is most plausible and could be checked by obtaining role 
expectations that achieving and underachieving girls have for males and 
females. On the other hand, our third. postulate would propose that 
achievement strivings are less in awareness for the underachievers than 
for the achievers. If this were the case, then it could be predicted that 
underachieving girls would project more of their achievement strivings 
onto male figures than female figures. — | | | 

Summary. There is strong experimental evidence showing that in- 
dividuals project their affects 1n accordance with the third postulate. 
The application of this postulate to the hero assumption and the assump- 
tion with respect to physical similarity between stimulus figures and the 
projecting individual was examined in great detail. Unfortunately, most 
of the studies on these assumptions suffer from methodological weak- 
nesses. However, the evidence does indicate that type of stimulus figure 
on the TAT plays а critical role in controlling type of attribution. 


Four: PERCEPTUAL DYSFUNCTIONING AS A FUNCTION OF 


ments, and pictures was found. 


proper agents for 
attribute achievem 


D. PosrULATE 
PROTUSIVE STIMULI 

Acutely protusive stimuli will disrupt perceptual and cognitive 

ctions in individuals with weak decentering ability. 


organizing fun - 
arch on the development of perception and cog- 
1 8 


Contemporary rese 
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е immediate stimulus environment. At birth and 
» obj possess no symbolic meaning, and 81 
У in terms of their color and shape. Pany 
y is immediate and spontaneously directed towarc 
8 perceptual maturity, however, the 
impact of the immediate stimulus "pon perceptions and cognitions be 
comes less and less, Proximal stimulation becomes subordinated to 
decoding systems and to abstract schemata. When a child has reached 
the stage of concrete and formal Operations, he is no longer under the 
Sway of the salient features of stimuli. He can decenter or refocus ОП 


various aspects of the immediate external and internal environment by 
highly flexible and balanced modes of perceiving. 

From these notions, it follows that 
systems and good decentering ability 
that permit €asy accommodation to 


in 
a large number of inei 
are the two most important factor 


Protusive stimuli, While what con- 
stitutes a protusive stimulus for any individual can only be discovered by 


experimental techniques, it is Suggested that any stimulus that cannot 
be readily assimilated into decoding Systems is likely to function as 4 
protusive cue, Kreezer's (1958) threshold technique for measuring the 
attention demand value of à stimulus is one possible operational method 


for determining protusive stimuli, Bright colors and affectively charged 
or threatening figures are tw imuli which may be pro- 


tusive for many Tode x F decentering ability, such 
stimuli would, ene be | мн existing decoding 
system. On the other han , indiy mal decentering ability 


a decoding system. In 
raise the individual's 


‘sort to an early mode 


of perceptual responding, that is, by ately and. directly 


toward the salient features of protusive st 


‚зе effects of Protusive stimuli are 
not likely to be shown on projective techniques unless individual 
differences in decentering ability are taken into account, IF th 
ing variable is overlooked in an experimenta] design, then th 
protusive stimuli will not be manifest, | 

The bridge between these theoretical notions and Ptojective methods 
will be limited to the possible effects of bright color as Rb. 
protusive stimulus. As applied to the Rorschach, į 


lysfunction;,. 8 Predicted that 
protusive colors may cause perceptual dys unctioning j 


n individuals with 


е decenter- 
е effects of 
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weak decentering ability. The perceptual dysfunctioning is probably best 
assessed by combining the C and CF Rorschach determinants. Several 
factor analyses of the determinants have shown that the Rorschach’ does 
measure a lack of perceptual control factor. In. both psychiatric and 
undergraduate students, Wittenborn (19502, b) isolated a factor that 
involves a spontaneous perceptual approach to inkblots. The high 
loadings on С and CF indicate that spatial features of the inkblots are 
being minimized. The clustering of C and CF on a lack of perceptual 
control factor was also confirmed by Williams and Lawrence (1953). 
Inasmuch as the FC determinant did not participate in the loadings on 
this factor, it would seem appropriate to treat the personality implica- 
tions of the FC response as different [rom CF or C responses. 

Unlike many other hypotheses about color as a protusive variable, 
the present formulation is not rooted in the assumption that color 
evokes affective responses. In essential agreement with Schachtel (1943), 
Rickers-Ovsiankina (1943), and Shapiro (1960), the present hypothesis 
asserts that primitive color percepts represent a more immediate form 
of perception and is indicative of perceptual dysfunction. In basic 
agreement with Shapiro (1960), the present formulation remains theoreti- 


cally neutral about the relation between color and affect. 


1. Evidence That Primitive Color Percepts Represent an Immediate 
Perceptual Mode and Are Reflective of Perceptual Dysfunctioning 


While factor analyses of the Rorschach determinants indicate that 
protusive chromatic stimuli may produce а lack of perceptual control 
factor, is there more direct evidence to substantiate the hypothesis that indi- 
viduals scoring high on such a factor manifest an immediate and 
Spontaneous approach to the environment? А few studies (Mann, 1956; 
Singer and Spohn, 1954; Clark, 1948) have provided data that support 
the notion of a linkage between color responses and a spontaneous mode 
of perceiving. Research (Bills, 1954; Costello, 1958; Kohler and Steil, 
1953; Wittenborn and Holzberg, 1951) has shown that depressive indi- 
viduals make little use of color, indicating a lack of spontaneous and 
immediate reactions to protusive stimuli. In their factor analysis of 
Rorschach determinants and MMPI scales, Williams and Lawrence 
(1954) also obtained a significant negative correlation between the 
Depression scale and CF and C determinants. Superficially there was a 
finding in this study which would appear to contradict the present 
theoretical expectations, if ego strength is viewed as a reflector of a 
perceptual capacity to synthesize and inhibit excessive stimuli. This 
finding was the correlation of .50 between CF scores and Barron’s (1953) 
Ego-Strength (Ёз) scale. However, the construct validity of the Es scale 
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is open to question. For ex 


ample, Gottesman (1959) found significantly 
higher Es scores for delin 


quents over normals. Barron (1956) eer 
found that Es Scores correlated. with à cluster variable indicative 0 
aggression. Moreover, in his original study Barron (1953) found that 
Es scores were Positively correlated With the variables of outgoingness 
and spontaneity and negatively with introspection, Such data, if anything; 
indicate that th ations between Es and CF support the 
2. Experimental Manipulation of Color on the Rorschach 


The finding of Brosin and From 


individuals manifest “color shock” 
whole question as to whether color 


Our hypothesis Suggests that color should have some effects. Since the 
early experimental endeavors of Wallen 


49), 
Я 3 (1948) and Lazarus (1948, 1949) 
many investigators have studi - 


ч : atic and 
є с died color effects by comparing chromatic ат 
achromatic versions of the 
Since numerous other 


Stiff, 1951, 1959. Allen, Stiff 


; rotic 
m (1940) that color-blind ae 
Was the first report to raise v 
3 eem nri 
influences response performan 


and 

1951; Allen, Manne an 
ker 

ы, A > 1953; Barnett, 1950; Вике 

and Williams, 1951; Dubrovner, Von Lackum and Jost, 1950; Perlman, 

a 

951: Sa eld 2 ч : Jost, 1950; 

1951; Sappenfield and Buker, 1949; Swartz, 1953) have followed the 

general methodologica] approach of Lazarus (194 

mental design merits critical evaluation, It involved (a) the administra- 

tion of the standard Rorschach inkblots and their achromatic matches in 

a counterbalanced order to tw г $, апа (b) а statistical com- 

" » ч а 51: E e 
parison of the standard chrom and the achromatic version on 
: a а 

various scores that are thoy atic of color listurbante- 

а color dis 

мир И " н . 5 against the test-retest design: 

of which the following im ра агу crucial: (a) the assumption that 

an AB order of presentation Involves the P 


Огу and set effects as 
and (b) since color 
5, depending upon à 


olor and Personality structure, the 


Many critics have raised c 


analysis of data based upon only leu t.a carum 
cancellation effect. While recent inv ауе üsuall ый be А а the 
counterbalanced design, only a few have attempted to "s "ein account 
the possible interaction effects among kind o color, тебе btivit ў" color, 
and personality structure. It is for this reason that ace rs research 
studies manipulating color on the Rorschach Бетен are not 
pertinent to the personality аш Perceptu; theses about the pro- 
tusive role of color. In view of this deficien Y, no ?Uempt will be made 


to discuss most of these studies in апу great detail, However Baughman 


ор 
го 
Gt 
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(1958) should be consulted for the details of the studies conducted before 
1958. 

a. Studies with Normal Subjects. With the preceding reservation in 
mind, a brief review will be provided of the studies that have investigated 
the influence of color with normal Ss. Using 100 high school seniors, 
Lazarus (1949) found few differences between the achromatic series and 
the chromatic series. He did, however, find a higher F% and number of 
space responses on the noncolor cards. There was also a reduction in 
the number of D responses to the achromatic version. Since this experi- 
ment, several similar experiments (Allen, 1951; Allen, Manne and Stiff, 
1951, 1952; Barnett, 1950; Canter, 1951; Dubrovner, Von Lackum and 
Jost, 1950; Grayson, 1956; Meyer, 1951; Sappenfield and Buker, 1949) 
with normals have shown few or no differences in response performance 
between chromatic and achromatic versions of the Rorschach. 

However, two recent studies indicate that color does have an 
influence on Rorschach performance. Exner (1959) investigated the role 
atic color on Card I of the Rorschach by 
atic versions of this card by the 
The Ss, male and female students, 


of chromatic and achrom 
constructing three different chrom 
monochrome-dye coupling method. 
were divided into four matched groups of one hundred cases each. The Ss 
administered a different version of Plate I. The 
d were blue, green, and brown and the 


e standard Card I. The data showed that the 


in each group were 
chromatic versions of this car 


achromatic version was th 
blue card yielded the greatest number of responses, the shortest reaction 


time, and significantly more wholes, small details, human movement, 
whole human figures, object content, “bell” content to Beck's Dd24, snow 
content, and animal face content than the other version of Card I. While 
it is a little disappointing that Exner did not include a red version of 
Card I, most of his data on the blue card is in keeping with the finding 
that blue is relaxing (Wexner, 1954), and produces less cortical and 
autonomic arousal than red (Gerard, 1958). The green card produced 
significantly fewer form determinants than any other version of Card 1. 
"These findings strongly support the conclusion that different colors have 
dissimilar effects. This conclusion, as Exner indicates, supports Crump- 
ton's (1956) finding that a color perception on one Rorschach card may 
not indicate the same interpretation as а similar response on a different 
plate, and reaffirms the implications of the first postulate. 

In the most comprehensive analysis of the stimulus structure of the 
Rorschach to date, Baughman (1959) prepared six different modifications 
of the standard Rorschach inkblots. His findings on color-achromatic 
comparisons will be limited to a discussion of plates 11, 111, VIII, IX, 


and X. On Plate II, the chromatic series yielded fewer movement and 
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human percepts but more blood responses to the D2 area than any of the 
other card versions. On Plate ПІ, color increased anatomical responses 
to the D$ area and lowered the number of responses to D2. Content 
analysis of Plate VIII showed that color depressed bear concepts, and 
catlike and landscape responses to the D1 area, and facilitated the 
butterfly, flower, and ice cream percepts to the D2 area. The chromatic 
series of Plate IX produced few minor details, but many whole percepts. 
Color also depressed the human percept to the D4 area. The colored 
version of Plate X produced an increased sensitivity to seed percepts to 
the D3 area, caterpillar or worm responses to the D4 area, rabbit head 
percepts to the D5 area, insect percepts to the D8 area, and flower percepts 
to the D15 area. While only a sampling of Baughman’s data has been 
presented, two primary conclusions may be drawn: Color does influence 
Rorschach performance, especially the content categories; and color may 
either facilitate or inhibit responses. 

The question now becomes one of accounting for individual 
differences in response to color. Conceptually, as already indicated, the 
studies of normals do not bear directly on the hypotheses relating color 
perception to personality variables, A design is required that will permit 
testing for the triple interaction effect among types of intrusive and non- 
compelling colors, receptivity to color, and personality structure. 

b. Studies with Neuropsychiatric Subjects. Some researchers (Allen 
et al., 1953; Baughman, 1954; Buker and Williams, 1951; Canter, 1958; 
Crumpton, 1956; Sterling, 1950; York, 1 
the limitations of a normal sample by using psychiatrically disturbed Ss. 
However, studies with a single group of patients are no closer to an 
appropriate design than studies employing a single group of normals. 
Both designs overlook the requirement that interaction effects should be 
built into the experimental design. 

With this qualification in mind, three studies (Allen et al., 1953; 
Baughman, 1954; Buker and Williams, 1951) have shown that the removal 
of color does not seriously influence Tésponse performance on the 
Rorschach. Canter (1958) showed that an unshaded bright color version 
of Plates VIII, IX, and X produced an inhibition of color-dominant 
responses. Accuracy of form perception was impaired in the York (1951) 
study. With a sample of 10 psychotics, 10 neurotics, апа 10 organics, 
Crumpton (1956) showed that color has no influence on color shock 
signs, productivity, reaction time, 8-9-10%, rejections, minus form quality 
of responses, P%, A%, avoidance of red area on Plates II and III, and 
irregularity of succession. However, color did have effects on content 
categories, supporting the findings of Baughman (1959) and Exner (1959) 
with normals. Whatever advantage these studies might have gained by 


951) have attempted to escape 


о 
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the use of disturbed Ss was eliminated by failure to separate the patients 
in terms of personality variables hypothesized to be related to color 
perception. Thus, the findings from these studies cannot be related to 
any of the hypotheses about color on the Rorschach. 

c. Studies with Interaction Effects. Five studies have contrasted 
normals and disturbed Ss on chromatic and achromatic versions of the 
Rorschach. Three (Brody, 1953; Perlman, 1951; York, 1951) of these 
studies have compared neurotics and normals on the two versions of 
the Rorschach and found that the effects of color are minimal or absent. 
While these studies approach the ideal design, they overlook the fact 
that the "crude" classification of normal or neurotic masks the individual 
differences within each of these groups which are likely to produce 
significant effects. It is well known that individual differences in auto- 
nomic and perceptual reactivity in neurotic and normal samples are 
likely to be large. Accordingly, the differences between each of the groups 
are likely to be canceled out, which ensures that they will not differ on 
most dependent variables. Rather than employing heterogeneous samples, 
one should use groups which are high and low on the specific personality 
ibles assumed to be related to color perception, such as affect, 
inadequacy in perceptual decentering, and the like. 

In a study that approximates the ideal design, Holtzman, Iscoe, 
and Calvin (1954) tested for relationships of type of color, anxiety, 
stimulus structure, presentation order, and examiner. Unfortunately, 
the high- and low-anxiety Ss were chosen on the basis of the Taylor 
Manifest Anxiety Scale (1953) and the Winne Scale of Neuroticism (1951). 
This means that the contrasting groups of college women were composed 
of those who admitted to neurotic symptoms and those who did not. In 
anticipation that type of color may produce an effect, the results of 
Plates II and ШІ were analyzed together because their chromatic versions 
are both red and black. Similarly, Plates VIII and IX were grouped 
together because their standard versions involve multicolored pastel 
shades. A balanced experimental design was employed, in which each $ 
received two chromatic and two achromatic inkblots in a predetermined 
order. Contrary to the theoretical expectations of some Rorschach 
workers, the findings did not support the assumed linkage between color 
and anxiety. Differences between the chromatic and achromatic versions 
were minimal or absent. Forsyth (1959) also found no significant dif- 
ferences when high anxious and low anxious male undergraduates were 
tested for differences on Elizurs content anxiety score. Anxiety was 
defined by the Welsh (1956) manifest anxiety scale and the comparisons 
made between à chromatic, an achromatic, and a silhouette 
A significant interaction effect showed that a moderately 


were 
Rorschach series. 
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anxious group produced a low Elizur anxiety score on the colored cards 
and a high Elizur anxiety score on the achromatic and silhouette 
Rorschach series. While the Holtzman et al. and Forsyth studies are 
neatly conceived, their findings are only suggestive and do not provide 
critical evidence on the possible interaction effect 


among color type. 
color receptivity, decentering 


activity, and personality structure. 

d. Studies with Segments of the Rorschach. The effects of color on 
Rorschach performance deserves one final methodological comment. In 
the studies reviewed to this point, the complete Rorschach inkblot has 
been used. There have been, however, a few studies (Siipola, 1950; 
Siipola, Kuhns and Taylor, 1950; Lazarus апа Oldfield, 1955) in which 


segments of the usual details have been used. While results from these 
studies have indicated at least a mod 


est effect of color on response per- 
formance, the results of such studies m 


пау not apply to intact inkblots. 
3. Experimental Manipulation of Color on the 
Color has been shown to have eflects on 


techniques ini four of five stidies, Brackbill (1951) found that neurotics, 
unlike control Ss, told more depressive stories to chromatic TAT pictures. 
Thompson and Bachrach (1951) found that both whites and Negroes 
gave significantly higher words on the chromatic Зери of the ТАТ. 
Lakin (РЗА) aaa fosa significantly higher word counts on a chromatic 
TAT for mentally retarded Ss, as did Lubin and Wilson (1956) for 
handicapped children. On the other hand, color did No: saflivemee the 
transcendence index in the Weisskop{~Joelson and Foster (1962) experi- 
ment when color and its absence Were varied on the CAT In verres 
these studies only demonstrate that color facilitates imd Ж. Loren 
In no way do they provide crucial information on ie ini po зу: 
among types of color, receptivity to color, perceptual кем 
personality structure. 

Summary. Only indirect evidence supports the validity of fourth 
postulate, namely, the finding that color perception . My o ihe € 
mediate approach to the environment. While recen Spresents an i ; 
that color influences response performa t studies have shown 
adequate experimental research rem 
that protusive stimuli disrupt per 
decentering ability. 
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STULATE FIVE: ANXIETY AS А Буусу 

E. Post AS A FUNCTION ок Cry 

Diffuse chiaroscuro stimuli evoke 
which tend to be anxiety arousing. 

Working with Binder's (1933) interpretations "T 


AROSCURO STIMULI 


vague 
= homogeneous perceptions 


aroscuro responses 
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on the Rorschach, Bohm (1960) and others have indicated that the two 
conditions necessary for the occurrence of a homogeneous sensory percep- 
tion are similarity between stimulus elements and indistinct delineation. 
The latter condition is usually met by chiaroscuro stimuli. However, 
both conditions are met by exposing Ss to a homogeneous field (Ganzfeld). 
In order to achieve a Ganzfeld, similar to that described by Hochberg, 
Triebel, and Seaman (1951) and Kenny and Chappell (1963), Ss are 
ups which are cut from table-tennis balls to fit the 
A red light is projected onto the balls. Within 
the initial red perception changes into 
appears to the S that he is being 
and he may describe his 
ike. In short, the 5 


required to wear еуеса 
shape of the eye socket. 
a relatively short period of time, 
ап unpatterned gray visual field. It 
stimulated by diffuse chiaroscuro stimuli 
perceptions as being foglike, cloudlike, or simply gray] 
has vague, homogeneous perceptions. 
In terms of the present postulate, the induction of a Ganzfeld should 
be anxiety arousing. A recent experiment (Kenny and Chappell, 1963) 
has provided empirical support for this notion. Experimental 5s were 
subjected to a Ganzfeld for 20 minutes, and while still under this con- 
dition gave stories to six TAT card descriptions. A comparable control 
also told stories to the TAT card descriptions. The success of the Ganzfeld 
as assessed both by an anxiety questionnaire 
ng their stories and. by scoring the ТАТ 
anxiety questionnaire results showed the 
al Ss to be more anxious than the controls, and the TAT 
findings offered substantial support for the hypothesis that a continuous 
gray field is anxiety arousing. Six out of the 10 TAT anxiety indices 
fferences in the predicted direction. 
riment offers evidence in support of the hypothesis 
that diffuse chiaroscuro stimuli evoke vague homogeneous perceptions 
which tend to be anxiely arousing, it only offers indirect support for any 
anxiety interpretation of chiaroscuro material on the Rorschach. 
However, there is other evidence congruent with such an interpreta- 
tion. In an analysis of the connotative meaning of the Rorschach, Rosen 
(1960) had his Ss make semantic judgments on the Rorschach inkblots 
Rorschach responses. The Ss rated the response 
“cloudiness” l, hazy, cold, and dull. As Rosen indicates, such 
undesirable connotations are close to a description of generalized anxiety. 
Ultimately, the validity of this postulate must be tested either by 
anxiety or by using psychiatrically defined cases of anxiety and 
responses with those of matched controls. In terms of 


as an arouser of anxiety W 
completed by the Ss after tellir 
stories for anxiety effects. The 
experiment 


showed significant di 
While this expe 


and nine common 
as bad, sac 


inducing 
comparing their 


the notion of homo 
he shading material to Binders ChF and pure Ch 


geneous perception, it would be desirable to limit 


the scoring of Ч 
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responses. With such a stipulation, it becomes extremely difficult to ск 
the validity of the fifth postulate through an examination of the research 
relating anxiety effects to the Rorschach, as investigators have used 
diverse scoring criteria for shading. If citation is limited to studies 
that come close to Binder's two scoring categories, the studies of Levitt 
(1957), Cox and Sarason (1954), and Levitt and Grosz (1960) support the 
proposition that shading responses are correl 
findings buttress the present postulates, 
the three studies used different kinds of 
of anxiety. 

Summary. 'There is indirect evidence to support the notion that 
diffuse chiaroscuro stimuli instigate homogeneous perceptions which tend 
to be anxiety arousing. However, a great deal more experimental work is 


required on the present postulate before its validity status becomes 
known. 


ated with anxiety. Such 
especially when it is noted that 
$s and different external criteria 


F. POSTULATE Six: IDIOSYNCRATIC RESPONSES 
ntial cultural experiences, response deviations 


If one rules out differe ‘ 
(idiosyncratic responses) from the perceptual demands of the projective 
stimulus are indicative of significant personality characteristics. 

From a research and clinical point of view, two interlocking ques 
tions emerge from this general proposition: 


l. Is reality-oriented behavior indicated by conformity to the 
demands of stimulus pull? 


ER ы ЧИР ? 
2. Ате idiosyncratic responses more significant than usual responses: 


To both questions, the answer appears to be affirmative. The first 
implies the assertion that the larger the response discrepancy from the 
stimulus pull of the projective material, the greater will be the impair- 
ment in the individual's ability to integrate perceptual and cognitive 
processes. Given the question on the Rorschach, “What might this be?,” 
the individual has to attend to the stimulus, and, to the extent that the 
response is not stimulus bound, the inference may be drawn that 
perceptual and associational functioning are not being integrated. 
Putting this assertion the other Way round, it is assumed that if the S is 
overresponding to his own associations and cognitions, then he is less 
likely to be attending to the reality demands of the оне situation. 
Lest the logic of this assertion be misunderstood, it should be stressed 
that it is not implied that compulsive ad he stimulus ә стени 
ment reflects ideal psychological health of the : 5 curan 
response adherence to stimulus d sycholo i Ron is 
undoubtedly nonmonotonic and cl erted U. Sica а 


ty to integrate perceptual 
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and cognitive processes. Clinically speaking, the three basic factors that 
probably produce idiosyncratic response are minimal decentering ability, 
neurophysiological dysfunctioning, and high levels of arousal in the 
presence of ambiguous cues. Strong motives per se should not impair an 
individual's respect for reality, unless some variable like cortical dysfunc- 
tioning is also present. 

Before examining the research literature on this hypothesis, one 
orth raising. In any rigorous test of the 
postulate, it would be most important to select stimuli that have highly 
predictable responses, that is, stimuli with few associations. If a stimulus 


evokes many different responses, the experimenter will have problems in 
у I » 


establishing criteria for idiosyncratic responses. A control] over the 


number of associations is also important in research which seeks to 
demonstrate that particular kinds of stimuli produce idiosyncratic 
responses in Ss with special kinds of conflict. For example, it has been 
commonly assumed that schizophrenics, relative to normals, produce a 
large number of idiosyncratic responses to emotional words in a word 
association test. Two studies (Deering, 1963; Jones, 1957) have found that 
schizophrenics do give more idiosyncratic associations to emotional 
stimuli than do normals. However, as Deering (1963) and Laffal (1955) 
both stress, the increase in number of unique responses may not be due 
to affect, but simply because emotional words evoke a larger number of 
associations than neutral words. In other words, emotional and neutral 
words should be matched in terms of number of associations they evoke, 
preferably by choosing words with a small number of associations. 
Generally speaking, clinicians make the assumption that unusual 
themes or perceptual distortions to TAT cards are more revealing than 
usual themes or perceptual accuracy about personality. Only one study 
has examined this problem. Miller and Scodel (1955) obtained a usual 
and an unusual TAT story from each of 35 psychotherapy clients. Six 
npted “Биша” matching of the 35 common and 35 


judges then atter ' 
unusual TAT stories to typescript records of the first two psychotherapy 
sessions of these patients. While the judges’ matching was better than 


chance, the unusual stories were no better matched than the usual ones. 
Whether this negative finding provides critical evidence against the 
s open to question, as the findings suffer from the fact that 
ata were used. The definition of usual and 
judgment rather than objective normative 


methodological point is w 


hypothesis i 
only early psychotherapy d 
unusual rested on clinical 


data. 
At the present time, the ratio (F 4- %) between accurate and unclear 


form perception is the best measure of response deviations from the 
perceptual demands of the Rorschach inkblots. It is not within the scope 
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of this paper to discuss the problems involved in specifying what con- 
stitutes good or poor form, although the solution to them ultimately 
hinges upon a statistical criterion of response frequency. Hypotheses 
about the validity of F + % have stimulated some research, most of which 
has involved a comparison of clinical groups. Korchin (1960) should be 
consulted for a brief review of some of these studies. It is sufficient to 
state that the main evidence concerning F+% tends to confirm the 
hypothesis that form accuracy is related to adequacy of perceptual contact 
with reality. Original research is required, however, in which inkblot 
stimuli are constructed with known variations in their difficulty level for 
yielding accurate form percepts. With material conforming to a cumula- 
tive homogeneity test model, the resulting threshold measure of contact 
with test stimuli should correlate significantly with external criterion 


data. Such a measure should also provide a purer measure of contact 
with reality than F + %. 


Finally, it should be noted that the Tomkins— 
ment Test (Tomkins and Miner, 1957) rests explicitly upon the assump- 


tion that rare and homogeneous responses to simila 
indicative of significant personality 


Horn Picture Arrange- 


r test stimuli are 


n Rer і characteristics, While the standardiza- 
tion of this test is excellent, more research is required on the validity of 


the scoring keys for personality. However, the diagnostic power of this 
test should ultimately prove to be high. 

Summary. The validity of this postul 
accuracy data of the Rorschach and, to 
experiments. It is suggested that new ma 
in order to test its validity and for use i 


ate rests primarily upon form 

a lesser extent, word association 

iterial be Specifically constructed 

n clinical situations. 

С. POSTULATE SEVEN: ANXIETY Err 
Gr 


CTS AS A FUNCTION OF STIMULUS 
RALIZATION 


With increasing perceived similarity betwee 
situations or figures and their stimulus тертеѕе 
material, anxiety effects should. increase by gradual increment, 

g e i. 


This postulate rests upon the validity of the phenomenon al samui 
generalization and the assumption that projective techniques will validly 
reflect anxiety effects. The fact of stimulus generalization Жош well 


n high-anxiety-evoking 
ntalions on projective 


established, as does the assumption that anxiety effects are shown by 
projective instruments. Two recent studies (Walker aid Aion 1958; 
Kenny and Chappell, 1963) have shown that experimentally е: Ed 
anxiety is reflected in thematic *pperceptive stories, While d - ability 
of the Rorschach to reflect anxiety depends "pon type of i a bili м 
determinants, Neuringer (1962) concluded, after P ejas 


ат vi M of th i 

а 1 . .TeView пе literature 
that laboratory stress induces lowere productivity, fewer Ww and P ani 
and P a 
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more m responses. Real life stress leads to more F responses and a 
general reduction in the number of responses, M, m, and color responses. 

Data from three studies are relevant to the present postulate. In 
connection with a study on the deviant family background of schizo- 
phrenics, Baxter and Becker (1962) hypothesized that poor premorbid 
male schizophrenics come from mother-dominated homes which are high 
in conflict and aggression and that good premorbid male schizophrenics 
come from father-dominated homes in which the mother was weak and 
the father was tyrannical. On the basis of this and related considerations, 
the authors predicted that poor premorbids should produce more anxiety 
to a maternal figure than to à paternal figure on the TAT, while good 
premorbids should manifest the reverse pattern. Cards 6BM (mother-son) 
and 7BM (father-son) of the TAT were given to 18 poor premorbid 
schizophrenics and 12 good premorbid schizophrenics. The predicted 
interaction between anxiety and parental figure was significant, with the 
poor premorbids showing more anxiety to the mother figure than to the 
father and the good premorbids showing the opposite pattern. This 
supports the notion that anxiety is generalized to pictorial representa- 
tions of the original instigator of anxiety and again reinforces the notion 
that fruitful predictions must take into account the cue properties of 


stimulus figures. 
Two studies by Epstein 


and Fenz are relevant to this postulate. In 
their first study (Epstein and Fenz, 1962), novice parachutists were tested 
vith four levels of stimulus relevance to 
anxiety words. Two parallel forms of 


the word association test were constructed, since the test was administered 
on the day of the jump and ona control day. For eight Ss the control day 
was 2 weeks after а jump and for another eight Ss the control day was 2 
weeks before a jump- Sixteen nonparachutists served as controls. In 
accordance with the notion of stimulus generalization, the results showed 
that the GSR and the mean reaction time in seconds for the parachutists 
nically as а function of stimulus relevance and was 
steeper on the day of the jump than on the control day. In their other 
investigation, Fenz and Epstem (1962) employed the same Ss and experi- 
mental design. However, two sets of TAT-ike stimulus cards were sub- 
stituted for the word association test. The six pictures in each set 
contained four unrelated to parachuting, one slightly related, and one 
strongly related. Congruent with the findings from the word association 
study, the GSR to the initial impact of the pictures increased as a 
imulus relevance and was steeper on the day of the jump. 


function of stt { ег | 
Reaction time on the day of the jump was similar between neutral and 
low-relevance pictures, but was significantly longer on the picture 


on a word association test V 
parachuting and a set of general 


increased monotor 
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strongly related to parachuting. Another commonly used indicator of 
anxiety—frequency of pauses—showed similar results to the reaction time 
data. It should be pointed out that Epstein and Fenz did not employ the 
notion of stimulus generalization of fear to predict their results. On the 
basis of a modified version of Miller's (1948, 1959) model of approach- 
avoidance conflict, they assumed that the gradients of approach and 
avoidance drives may be added, without regard to sign, to obtain | 
measure of conflict-instigated activation. Accordingly, they formulate¢ 
the prediction that the activation gradient will increase as a function of 
increasing stimulus relevance. With the additional assumption that 
activation may be measured by 
the GSR, the obtained results 
Thus, it would appear that th 
either to a stimulus generaliz 


autonomic reactivity indicators, such as 
also conform to their conflict model. 
€ results of both studies could be reconciled 
ation of fear concept or to a conflict model. 
In their first study, Epstein and Fenz (1962) argue that activation is 
shown by the gradient of GSR and not fear, on the grounds that 
parachutists on both the jump and control day produced larger GSR's 
to the high-relevance parachuting words than to the anxiety words. This 
argument is not persuasive on two grounds: (1) The anxiety words йе 
included in the first place solely on the grounds of curiosity, and (2) 
through prior conditioning, fear may be more attached to high-relevance 
parachute words (e.g, target, jump, bailout) than to a priori chosen 
anxiety words (e.g., blackout, injury, hurt). 

Summary. From these three studies 
does generalize as a function 
fear-producing situation 
on projective tests, 


‚ it would appear that uci 
of perceived similarity between the origina 
and the stimulus representation of these events 


H. POSTULATE Еснт: DnivE-SrIMULUS RELEVANCE 


For stimuli which have been scaled for a single drive, the following 
ee (а) When only highly structured drive-stimulus 
material evokes thematic drive content, the individual possesses little or 
mo drive represented by the stimulus material; (b) where minimal 
moderate, and strong cues of the drive material evoke drive content, the 
ROE Se памкан сау йана and preoccupied with the goal of the 
drive; (c) when strong-drive Ss are compared with low-drive Ss, the 
magnitude of thematic drive content differences should gradually increase 
with increasing stimulus relevance to the drive; and ШЕК р T 
drive cues increase and the drive has anxiety attached to it, anxiety effects 
should mount. 

In attempting to provide a theoretica] discussion of how the fore- 


going predictions are derived, the author wil] Present a Жез ЕЙ vercia 
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of a model presented elsewhere (Kenny, 1961). The model makes the 
following assumptions: (a) Proximal stimuli evoke decoding reactions 
which are perceptual attempts by the individual to translate or decode 
proximal stimulation into distal objects; (b) drive stimuli, external or 


internal in origin, will increase the probability that decoding reactions 


associated with the drive will occur, that is, drive stimuli narrow 


selectively the number of decoding reactions that are likely to occur; 
(c) as the number of decoding reactions instigated by stimuli increases, 
the arousal level of the person will also increase; (d) individuals seek to 
maintain moderate levels of arousal, avoiding high or low levels by 
filtering out stimuli that heighten or lower arousal level; (e) unless 
anxiety completely blocks the occurrence of decoding reactions related to 
à drive, anxiety level will interact with internal and external drive cues 
to increase drive-oriented schematic thought; and (f) drives are assumed 
to raise the gradient of stimulus generalization. 

In terms of these assumptions, various predictions may be made 
about how individuals will function on projective tests. If a set of 
projective stimuli has been scaled along а dimension of increasing drive 
relevance, a random sample of Ss should show a gradual increment in 
thematic drive content as drive-stimulus relevance increases. Since drives 
are assumed to raise the entire gradient of stimulus generalization, Ss 
with high drive should give more thematic drive content to cues further 
down the dimension of drive relevance than low-drive Ss. Similarly, if a 
person only gives drive content to highly relevant cues, it may be 
inferred that drive level is low. On the other hand, if drive content is 
present to all levels of drive-stimulus relevance, then the inference may 
be made that drive is excessively high and there is marked preoccupation 
with the goal of the drive. It should be noted that McClelland et al. 
(1958) have made similar statements from an inductive base. 

The preceding discussion has assumed that the drive state of the 
associated with anxiety. What happens to thematic drive 
drive has anxiety associated with it? If the association 
of anxiety with drive cues is exceedingly high, there may be a complete 
blockage of thematic drive content to all stimulus representations of the 
drive, Since such an event is likely to be rare in permissive testing 
the question becomes one of predicting what happens to 
tent when anxiety continues to operate. In terms of assump- 
e), above, highly structured drive stimuli should produce 
content than any other type of cue and should 
provide the best discrimination between low- and high-drive Ss. Of 
course, this prediction is bound to break down in the limiting case when 
cultural training has produced a strong communality of response to 


organism is not 
content when the 


situations, 
thematic con 
tions (b) and (е), а 
more thematic drive 
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stimulus representations of drive states. In such a case, most $s will 
respond in an identical manner, regardless of drive state. 

The implication of the present model that highly structured drive 
stimuli are the best indicators of them 
opposite of that of Epstein and his 
Fenz and Epstein, 1962). Their appro: 
the pivotal assumption that appro 
than avoidance drives as a 


atic drive content is exactly the 
associates (Epstein and Fenz, 1962; 
ach-avoidance conflict model makes 
ach drives have less steep gradients 
direct function. of increasing stimulus 
relevance, and, as a consequence, approach or goal-relevant responses 
should increase to stimuli of low relevance and decrease to stimuli of 
high relevance. Our survey of the literature will investigate the validity 
of these two opposing predictions, as well as the other hypotheses. 
Several studies have been concerned with the interaction between 
drive-stimulus relevance and drive level. It is difficult to evaluate the 
results of these studies because the stimulus properties of the projective 
stimuli and the precise range of the drive level are not clear. A strongly 
structured card for one researcher may be a moderately ambiguous card 
for another. What one investigator may call strong drive, another may 
call moderate. Until terminological differences between investigators are 
resolved and a methodology of defining drive-stimulus relevance is 
standardized, the task of comparing the findings of different studies is 
almost impossible. In spite of these problems, it does seem possible tO 
draw tentative conclusions from a review of the studies in ae area. 
1. Drive-Stimulus Relevance and Aggression 
Some doll play data, analyzed by Wurtz 
support the prediction that anxiety facilitates the express; f aggres- 
sion. Making the assumption that more anxiety is Бат б x. рп 
is expressed against parents than other children Wurte | à Ed that 
more direct and violent aggression will be press г S pum arent 
dolls than child dolls. The doll play results of 150 en ы P t 
the parent dolls were the objects of more Violent a * à ы child 
dolls, supporting the general Proposition that anxiet ЕЕ ыа inm i уе 
to increase drive-oriented schematic thought, iussi d 
In an attempt to evaluate the influe aternal eer 
on aggression, Weatherley (1962) divided female underer: лыда » 15 
into two groups of 50 in terms of mothe SIRE ANS odds 
which they permitted their children to EXPiess Apone ae degree to 
Half of the Ss in each group told TAT stori Beression as a child. 


ез after they } 
; 7 i MAES had been aroused 
to aggression by insulting and depreciating comment. i 1 Е n em н 

Е € other ha 1r 


each group not being exposed to arousal Conditions, Two 1 Is of 
A H e: y 8 + wo 'els О: 
aggressive stimulus relevance were used, with cards 8BM à ТЕСЕ 
and e" d 


(1960), may be used to 
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representing high cues and 2 and 6GF of the TAT representing low cues. 
The analysis of the data yielded a significant triple interaction among 
and stimulus relevance. А 


the variables of permissiveness, arousal, 
maternal-permissive 


breakdown of this interaction effect showed that high 
$s manifested a significant increase in thematic aggression to high 
aggressive cue cards when they were under arousal conditions. Low 
maternal-permissive Ss did not show this effect. If it is assumed that 
maternal permissiveness provides better opportunities for the reinforce- 
ment of aggression than does low maternal permissiveness, then the 
finding on the high maternal-permissive Ss confirms the theoretical 
expectations that highly structured drive stimuli are the best indicators 


of thematic drive content. Kagan's (1956) study of 118 elementary school 


children also showed that a picture most suggestive of aggression was 
less suggestive picture. While 


more predictive of overt aggression than а 
the least suggestive picture did not separate the aggressive from the non- 
aggressive children, more stories of aggressive fighting were produced by 


the aggressive Ss to the picture most suggestive of fighting than by non- 
aggressive Ss. 

However, other studie 
Support for our theoretical 
cards with moderate aggres 


s on the aggressive drive only provide partial 
expectations. Stone (1956) reported that TAT 
sive pull produced the greatest discrimination 
saultive and nonassaultive prisoners. High-aggressive pull 
pictures were not as discriminatory. In keeping with the present expecta- 
tions, however, the moderate drive-relevance pictures were more dis- 
criminatory than the low-relevance pictures. Since the high-aggressive 
pictures (13MF, 18СЕ, and 19BM) have stimulus dimensions that can 
instigate themes other than aggression, the use of these cards leaves 
something to be desired. In a study (Hokanson and Gordon, 1958) 
involving 40 college students, half of the Ss were classified as high 
expressors of aggression and the other half as low expressors on the basis 
of a manifest hostility questionnaire. Within each group, half of the Ss 

listening to a сазе history of a sadistic 


used to hostility by 
arousal, all Ss told stories to a TAT 


juvenile delinquent. Subsequent to I, all 
card low in aggression (card 2) and one high in aggression (card 8BM). 


When the Ss were aroused, the low expressors showed more thematic 
hostility than the high expressors. With respect to cue relevance, the only 
significant finding was that the high card produced more thematic 
an the low card for both groups. Saltz and Epstein (1963) 
college volunteers into three experiment designs on the 
port measures of hostility, guilt, and conflict over hostility. 
composed of 20 55, each of whom scored at the 
All Ss were administered six TAT- 


between a 


were aro 


aggression th 
divided male 
basis of self-re] 
The three groups were © 


extremes on each of these measures. 
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type pictures, two of low relevance to í nec adel, 
aggression, and four buffer pictures. In terms of their conflict bs ie 
Saltz and Epstein predicted that (a) hostile responses to прн. 
pictures are largely due to drive and hostile responses to high-re хе es 
pictures are due to guilt, and (b) conflict over hostility will cav high- 
overresponse to low.relevance pictures and an underresponse 07 кн 
relevance pictures, The data offered equivocal support for their ally, 
the second prediction. More ree 

t the first part of their prediction (a) ы 
confirmed by the findings of a positive relationship between the m 
tionnaire measure of hostility and thematic hostility on the two p "n 
of low aggressive relevance. Since high relevance was only n is 
one picture, the resulting error variance in the thematic score may a 
prevented any correlation between the high card and oe 
hostility. The second part of their first hypothesis (that guilt is heit 
reflected by high-relevance pictures) was not supported by two of th 
three analyses of the questionnaire measures, 


" з "ance to 
aggression, one of high relevanc 


2, Drive-Stimulus Relevance and Sex 


The preceding pattern of results 
are very similar to those found by Lei 
sex drive. In this study, 
administered four T 


; 
in the Saltz and Epstein pr 
man and Epstein (1961) on m 
66 unmarried university students were 
AT-like pictures of low. 


2 Sexual drive was b 


ata showed a positi 
SCX scores and the rate measure; 


between thematic sexy 
low-relevance 
and high-relevanc 
confirmed. Howe 
provide some sy 
authors th shown by an Overresponse to low- 
relevance pictures 


ver, a post hoc an 


and Epstein study on aggression. 


8, Drive-Stimulys Relevance and Fear 


The validity of Epstein’s hypotheses about conflict effects, th 


is best measured by Pictures of low relevance and th 
by an overresponse to low-relevance cues and 


at drive 
at conflict is shown 
an underresponse to high- 
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relevance cues, is cast into further doubt by the two studies on parachuting 
(Fenz and Epstein, 1962; Epstein and Fenz, 1962) previously described. 
It will be recalled that in both studies novice parachutists were given a 
word association test and a thematic apperception test on the day of a 
jump and also on a control day. In the study (Fenz and Epstein, 1962) 
dealing with thematic pictures, the results showed that parachutists on 
the day of the jump manifested significantly more thematic approach 
responses to the high-relevance pictures than to the low-relevance pictures. 
Such a finding casts into doubt the notion that the gradient of avoidance 
is steeper than that of approach, a pivotal assumption in the conflict 
model. It should be noted that the obtained finding on the high-relevance 
picture supports the present author's formulation of stimulus relevance. 
In the word association study, Epstein and Fenz (1962) found their 
prediction that parachutists would give more parachute-relevant re- 


sponses to stimulus words of low relevance and fewer such responses to 


stimuli of high relevance was not upheld, The parachutists on the day of 
nses to all levels of stimulus 


the jump showed similar approach respo 

relevance. In terms of their model, Epstein and Fenz also predicted that 
the number of perceptual misrecognitions of the stimulus words increase 
аз a function of increasing stimulus relevance to parachuting. However, 
the opposite effect occurred; that is, on the day of the jump the parachut- 
ists showed a decrease in misperceptions as stimulus relevance to parachut- 
ing increased from low to high. This finding offers strong support for the 
present author's second assumption. In essence, it follows [rom this 
assumption that increases in drive will facilitate veridical perception for 


stimuli related to the drive. 
r Drives 


Three other published studies also bear on the interaction effect 
between drive level and stimulus relevance. In examining the effects of 
about 86 hours' sleep deprivation on TAT stories, Murray (1959) found 
that the sleep-deprived group showed less thematic sleep content than did 
the control group and that they had significantly fewer sleep themes than 
the controls on a high sleep-suggestive card. The two groups did not 
differ in sleep themes to the low- and medium-sleep card. The experiment 
raises the general issue of whether sleep deprivation induces a drive. To 


this author, it seems more plausible to suggest that sleep deprivation will 
: f a person's arousal level. If this is the case, 


result in a general lowering 0 е : 
cted in terms of our fourth assumption. In 


Murray's results may be expe _ : 
een the hunger drive and stimulus 


an early study оп the interaction betw | i 
relevance, Epstein and Smith (1956) found that pictures of low stimulus 


relevance to hunger were better reflectors of the hunger drive than either 


4. Drive-Stimulus Relevance and Othe 
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a medium or a high picture- 
was used at each of the hi 
indicated, any 
unreliable. 
Murstein (1963a) reported an experiment in which he tested an 
important part of Atkinson’s (1958) theory of human motivation. In 
terms of this theory it may be predicted that, if the motive to achieve 
and the motive to avoid failure are held constant, achievement motivation 
will vary with the probability of success, with the highest possible 


motivation Occurring when the probability is .5. In testing this predic: 
tion, Murstein found that a .5 


pull card. Unfortunately, only one E 
gh and moderate picture-pull levels. As ipe 
findings based upon a single picture are likely to be 


act 
relevance was the ur 
А senifi: 
"ance to achievement produced the sign! 


5 and .1 group. If it can be safely assume! 
that the .5 group has a greater need to achieve than the .1 group, then 


such a finding is not in keeping with the present author’s formulation of 

H H H E d 
stimulus relevance. After reviewing the studies concerned with Atkinson $ 
theory, Murstein (1963b) has concluded, however, that the empirical 


evidence in support of the assumption that a 5 group has the highest 
achievement motivation is "rather fragile." 


Summary. Taken together, 
action effect between stimulus relevance and driv 
tions of the eighth postulate? The inv 
(1956), and Fenz and Epstein (1962) 
Most of the other studies offer mod 
is no support for the Epstein and 


do the studies investigating the inter- 


€ support the predic 
estigations of Wurtz (1960), Карап 
offer strong support for the model- 
lerate support for the postulate. There 


Fenz hypothesis that conflict will cause 
an overresponse to low-relevance pictures and an underresponse to high- 
relevance pictures. Finally, their hypothesis that ewe pictures 
are the best measures of drive js seen to be most suspect Alien м views 
it against all the studies which have been review 


ed in this section. 


I. POSTULATE NINE: AROUSAL LEVEL 


AS A FUNCTION ор AMBIGUITY 
For complex projective material, arosual level vari 


function of increasing ambiguity. 

It will be recalled that a distinction 
techniques based upon the cumulative h 
constructed on the pure relative frequenc 
TAT are the classic illustrations of the latter model, 
jective material, we mean tests based on the pur 
model. Before discussing the ninth postulate, it will b 
the methodological approaches to the concept of ambiguity, 


was made between projective 
omogeneity model and those 
Y model. The Rorschach and 
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1. Definition of Ambiguity 


Broadly speaking, there are two procedural techniques for defining 
the meaning of ambiguity. One may define a scale of increasing ambiguity 
in terms of increasing physical impoverishment. Some of the procedures 
that may be used to induce this kind of ambiguity would be reduction in 
time of exposure, photographic blurring, random or specific removal of 
stimuli, incomplete line tracings, reduction in illumination, and the like. 
Without any wish to imply that impoverishing techniques do not produce 
interesting experimental results, it is proposed that such operations in 
no way ensure that ambiguity has been manipulated. To this author, it 
does not seem to make sense to talk of an ambiguous physical stimulus. 
A stimulus is only ambiguous if it evokes different perceptual or decoding 
reactions. Impoverishment of the stimulus does not ensure that there will 
be an increase in the number of decoding reactions. In fact, their number 
might actually decrease under some conditions of physical impoverish- 
ment. 

As this author (Kenny, 1961) proposed elsewhere, an adequate 
definition of ambiguity must be in terms of decoding reactions. A good 
quantitative index of ambiguity would take into account the number of 
alternative decoding reactions to a stimulus and the proportion of cases 
responding with any given decoding system. This author has proposed 
that the Ё measure of uncertainty from information theory takes into 
account these two factors. If we let p(i) indicate the proportion of Ss 


f the different decoding reactions to a stimulus, then H 


Who give each o 

is given by 

й = —xp()log: P() 

For example, if a projective stimulus elicits four different. decoding 

reactions, with the proportion of individuals giving these reactions being 

-96, .02, .01, and .01, then А is equal to .2557. As Н increases, perceptual 
e ases. The original article (Kenny, 1961) 


or decoding ambiguity incre I 1 
a more extensive discussion and rationale of 


should be consulted for : : ‘ аш! ‹ 
the Н measure, as might Murstein's (1963b) interesting application of it. 
d des 


ibiguity and Arousal Level 


The assumption is made that increasing ambiguity heightens the 
dividual and that Ss attempt to hold to intermediate 


arousal level of the in я н $m 
iding high arousal levels. The additional opera- 


levels of arousal by avo! ion 
made that arousal level can be roughly indicated by 


tional assumption 15 е | 
ratings on Osgood's (Osgood, Suci, and Tannenbaum, 1957) connotative 
activity and that high arousal levels will be judged 


factors of potency and : | a 
as undesirable, 2 indicated by low ratings on the evaluative factor. In 


2. Am 
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i Sela wi ; and 
other words, ambiguity should correlate positively with potency 


Я Э : ; ӨЕ : con- 
activity and negatively with evaluation. This theorizing should be 
wasted with the often made assumption that 


ambiguity facilitates the 
expression of covert layers of 


personality. Any golden nuggets pun 
1954, 1961; Kenny and Bijou, 1953; Bijou and Kenny, 1951; Lipid у 
1958, 1963b; Gurel and Ullman, 1958; Laskowitz, 1968; her m 
1950a,b; Weisskopf-Joelson and Lynn, 1953) that such an шр 
had in the early days of projective testing have been mined and it 1509 
desirable to move on to a finer-grain analysis of psychological functioning 
on projective tests. The present postulate attempts such an analysis. us 

No published research on this postulate has yet appeared. agre 
this author and his associates (Barbara Long, Lynn Harvey, ges 
Moriya, and H. Kuechler) have accumulated a large amount of data © 
the H measure and its correlates, which is being prepared for publicatio! А 
While this is not the place to describe the details of the research, it ien 
be in order to mention that the first phase of the research required various 
groups of 5s to examine 26 TAT cards for a 20-second period and to 
describe what they saw. On the basis of these perceptual statements 
decoding categories were formed. In Order to obtain an index of rd 
biguity for each card, the uncertainty measure has been applied to this 
data. The second phase of the research has sought to determine rs 
correlates of the uncertainty measure, Only one finding of this researc?! 
will be reported, that bearing on the present postulates. 

Some unpublished data, collected by Atsuko Moriya under the 
author's guidance, offer confirmation of the postulate. Male and female 


university students served as Ss for the derivation of two Ё indices fo! 
each of 26 TAT cards, one for e 


and 
ach of the sexes, Two other male anc 
female groups rated each of the pictures on 9 bipolar concepts, three 
concepts for each of the three con 


Notative factors, In order to investigate 
the predicted relationships, the uncertainty measure rank order of the 


26 cards was correlated with each of the three rank orders of the cards 
based on the three semantic factors. For males, the rank order correla 
tions between ambiguity and evaluation, activity and potency, were — Ab, 
61, and .58, respectively. For females the Corresponding correlations меге 
—.52, .40, and .60, respectively. AI] six correlations аге statistically sig 
nificant at less than the .05 level. These findings strongly indicate that 
ambiguity is correlated negatively with the evaluative factor and posi- 
tively with the «Симу апі potency, factors, thereby supporting the 
postulate that ambiguity is arousal raising. However, д great deal more 
research is required before the postulate js Viewed as anything other 
than a tentative formulation. 

With some evidence in support of the notion that ambiguity is 
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correlated with approximate indicators of arousal level, the next research 
question under investigation is, “How does arousal level interact with 
the existing personality structure within the individual?” An adequate 
conceptual and empirical answer to this most challenging question would 
take us far beyond the scope of this paper, which deals with stimulus 


functions in projective techniques. 


Ш. Postscript 


Traditionally the focus of projective techniques has been personality 
centered, with only a token recognition of the fact that the stimulus might 
have something to do with behavior. Throughout the review, an attempt 
has been made to come to grips with the stimulus problem. In order to 
make an interpretive pathway through a maze of diverse findings, a set 
of postulates as to how the stimulus articulates with personality func- 
tioning is proposed. Lest these postulates and the theorizing back of 
them cause premature closure, it should be stressed that the present 
formulation is presented in a tentative and cautious manner. A margin of 
error in the postulates is expected. It is hoped that they are worded in a 
sufficiently precise manner to be empirically tested. An extension of the 
present set of postulates into personality functioning is planned. 

Much of the theorizing is not, of course, wholly original. In par- 
ticular, an attempt has been made to establish a point of contact with 
recent developments in the perceptual and cognitive areas. Furthermore, 
it is hoped that the bridge offered between ambiguity and arousal will 
prove fruitful. 

One final statement may be in order. If projective techniques are to 
projective personologists must maintain contact 
ling knowledge in many areas of psychology, 
on, learning, motivation, and measurement 
and interpretations in 


survive in psychology, 
with the rapidly expanc 
especially perception, cogniti 
theory, and modify their traditional instruments 


light of new information. 
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